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Expression in Escherichia coli of a Chemically Synthesized 
Gene for the Hormone Somatostatin 

Abstract. A gene for somaloxlutin, n mammalian peptide (14 amino acid residiie.\) 
hormone, wa.s synthesized by chemical melhod.s. Thi.s f^etie was fused to the Esche- 
richia coli f3-^alactosid(ise gene on tlie pUixmid pBR322. Transformation ofE. coli 
with ihi^ chimeric plasmid DNA led to the synlhesi.s of a polypeptide including the 
sequence of amino acids corresponding to somatostatin. In vitro, active somato.sta- 
tin was .specifically cleaved from the large chimeric protein by treatment with cy- 
anogen bromide. This represe.nt.% the first synthesis of a funcliuiud polypeptide prod- 
uct frotn a gene of chemically synthesized origin. 



The chemical synthesis of DNA and 
recotnbinant DNA methods pcovide the 
technology for the design and synthesis 
of genes that can be fused to plasmid ele- 
ments for expression in Escherichia coli 
or other bacteria. As a model system we 
have designed and synthesii:cd a gene for 
the small polypeptide hormone, soma- 
tostatin (Figs. 1 and 2). The major con- ■ 
sideration*, in the choice of this hormone 
were its small %\7.t and known amino acid 
secitience (/), sensitive radioimmune and 
biological assays (2), and its intrinsic bio- 
logical interest [3). Somatostatin is a tet- 
radecapeptide; it was originally discov- 
ered in ovine hypothalamic extracts but 
subsequently was also found in signifi- 
cant quantities in other species and other 
tissues (i). Somatostatin inhibits the se- 
cretion of a number of hormones, includ- 
ing growth hormone, insulin, and gluca- 
gon. The effect of somatostatin on the se- 
cretion of these horinoncs has attracted 
attention to its potential therapeutic val- 
ue in acromegaly, acute pancreatitis, and 
insulin-dependent diabetes. 

The overall constmction of the soma- 
tostatin gene and plasmid was designed 
lo result in the in vivo sytithesis of a pre- 
cursor form of somatostatin (see Fig. 1). 
The piccui"sor protein would not be ex- 
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pected to have biological activity, but 
could be converted to a functional form : 
by cyanogen bromide cleavage (<?) after 
cellular extraction. The synthetic soma- 
tostatin gene was fused lo the lac operon 
because the controlling sites of this oper- 
on are well characterized. 

Given the amino acid sequence of 
somatostatin, one can design from the 
genetic code a short DNA fragment con- 
taining the information for its 14 ammo : 
acids (Fig, 2), The degeneracy of the 
code allows for a large number of pos- 
sible sequences that could code for the 
same 14 amino acids. Therefore, the 
choice of codons was somewhat arbi- 
trary except for the following restiit- 
tions. First, amino acid codons known to 
be favored in E. coli for expression of the 
MS2 genome were used where appropii- 
ate (5). Second, since the complete se- 
quence would be constructed from J 
number of overlapping fragments, the 
fragments were designed to eliminate un- 
desirable inter- and intramolecular paif- 
ing. And third, GC-rich (guanine-cyto- 
sine) followed by A-T-rich (ademne- 
thymine) sequences were avoided since 
they might terminate transcription (6). 

Eight oligonucleotides, varying i" 
length from 1 1 to Id nucleotides, labeled . 
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'^im Hig- 2 as A through H, were synthe- 
^^i^ed by the triester method (7). In addi- 
^Hoti to the 14 cod on s for the structural 
ojdii'rrnation of somatostatin, several oth- 
ir features were built into the nucleotide 
t^equence. First, to facilitate insertion in- 
■,*4d plasmid DNA, the 5' ends have single- 
'^tnitided cohesive termini for the Eco RI 
'^nd Bam HI restriction endonucleases. 
I^etond, a methionine codon precedes 
'ibc normal NHa-terminal amino acid of 
- ;som;itostatin, and the COOH-terminal 
stodon is followed by two nonsense co- 
Hions. 

In the cloning and expression of the 
/svnthetic somatostatin gene we used two 
jjjdsmids. Each plasmid has an Eco RT 
'substrate site at a diflferent region of the 
'/J.galactosidase stiucttiral gene (see 
higs. 3 and 4). The insertion of the syn- 
'ithctic somatostatin DNA fragment into 
, (he Eco Rl sites of these plasmlds brings 
/ the expression of the genetic information 
in that fragment under control of the lac 
' operon controlling elements. After the 
„ mseilion of the somatostatin fragment 
' into these plasmids, translation should 
^ result in a somatostatin polypeptide pre- 
ceded either by ten amino acids 
(pSOMI) or by virtually the whole jS-ga- 
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Som 

Mef ■ aio ■ C3ly ■ Cys ■ Lys ■ Asn ■ Phe • Phe . 

S Trp 

S 
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|i-Gal Fragments + NHj • flia ■ Gly ■ Cys ■ Lys -asn ■ Phe ■ Ptie . 

s 

HO ■ Cys ■ Ser - Thr ■ Phe ■ Thr ' 

Active Somotostotin 

Fig. ]. Schematic outline of the experimental plan, The gene for somatostatin, made by chem- 
ical DNA synthesis, was fused to the E. coli jS-ealactosidasc gene on the plasmid pBR322. After 
transformation into E. coli, the chimeric plasmid directs the synthesis of a chimeric protein that 
can be specifically cleaved in vitro at methionine residues by cyanogen bromide to yield active 
mammaitan peptide hormone. 
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Fly 2. Chemical synthesis of the somatostatin I 
gene, (a) Eight oligodeoxy ribonucleotides, la- 
beled A through H, were synthesized by the DMTO "^-^p—Q- + H0~ 

modified triester method (7, 23). The codons | | ^g,^ 

are indicated, and their corresponding amino 0 0 

4Cids arc given. The eight fragments were de- 
.sitriied to have at least five nucleotide com- 
pienientary overlaps to ensure efficient joining 

by T4 DNA I igase. (b) Recent improvements 1 CI " U CI 

m the synthesis of fully protected ivimers, 
iwhich constitute codon blocks and are the 
Kliic units for building longer oligodeoxyri- 
txiniicleotides. With an excess of 1 (2 mmole), 
the coupling reaction with 2 ( I mmole) vi-ent 
dlniost to completion in 60 minutes with the 
aid of a powerful coupling reagent, 2,4,6- 
tmsopropyibenzenesulfonyl tetrazolide 

(TPSTe, 4 mmole) (2), The 5'-protecling ^0-.^ "•--n_r,- H0-. 
jjroup was removed with 2 percent benzene ° " 

sulfonic acid, and the S'-hydroxyl dimer 5 
could be separated from an excess of 3'- 
phnsphodiester monomer A by simple solvent 
extraction with aqueous NaHCOs solution in 
CHCi,. The fully protected trimer block was 
prepared successively from the i'-hydrojtyl 
dimer S. 1 (2 mmole), and TPSTe (4 mmole) 

and isolated by chromatography on silica gel {24). These improvements simplify the purifi- 
cntion step and lead to an increase in the overall yields of trimer blocks and to a decrease 

in the working lime by at least a factor of 2 (2/). The eight oligodeoxyribonucleotides then -p . , . ■ , 

were synthesized from the trimers by published procedures (7). The final products, after removal of all protecting groups, were purified by high^ 
, pressure liquid chromatography on Permaphase AAX (2,5). The purity of each oligomer was checked by homochromatogl-aphy on thin-layer 
Oh-^E-cellulose and also by gel electrophoresis in 20 percent acrylamide (slab) after labeling of the oligomers with [y-^^PlATP m the presence ol 
polynucleotide kinase. One major labeled product was obtained from each DNA fragment. 

1057 



0 
II 

-P-O" 
I 

0 



^0, 



■0^ 



OR 



TPSTe 



Trimer 



B = 

DMT' 



CI 



Protected Base 
4,4' -dimethoxytritjrl 
0 

-C-(0)-0ME 



ay 



-(aye 



' * DECEMBER TO? 



Board Assigned Page #1 769 



From C613) 991-5727 



Order # 06672290DP05570366 Mon 29 Dan 2007 09:49:34 PM EST 



Page 4 



i 



ii 



r 



i 




I (J «3 



3l< 



H 

^\r- II I- 



IS 



< 



— Hi < (_ 



< 



WO S 



< 



105S 



Board Assigned Page #1.770 



SCIENCE, VOL. 



From C613) 991-5727 



Order # 06672290DP05570366 Mon 29 Dan 2007 09:49:34 PM EST 



Page 5 of 9 



(Tc) and ainpicillin (Ap) on 5-bi'omo-4- 
chloro-indolylgalacloside (X-gal) medi- 
itm UO). On this indicator medium, colo- 
nies constitutive for the synthesis of /3- 
galactosidase by virtue of the increased 
number of lac operators titrating repres- 
sor, are identified by their blue color. 
Two orientations of the Hae III fragment 
are possible, but these were distin- 
guished by the asymmetric location of an 
Hha restriction site in the fragment. Pias- 
mid pBH 10 was further modified to elim- 
inate the Eco Rl cndonuclease site distal 
to the lac operator (pBHZO). 

The eight chemically synthesized oli- 
godeoxyribonucleotides (Fig. 2) were la- 
beled at the 5' termini with [y-'^P]ATP 
(adenosine triphophatase) by T4 polynu- 
cleotide kinase and joined with T4 DNA 
ligase. Through hydrogen bonding be- 
tween the overlapping fragments, the 
somatostatin gene self-assembles and 
eventually polymerijes info larger mole- 
cules because of the cohesive restriction 
site termini. The ligated products were 



lactosidase subunit structure (pSOMll-3). 

The plasmid construction scheme 
(Jj'ig Ji) begins with plasmid pBR322, a 
Vifeli-chdracterized cloning vehicle {8). 
'^he lac elements were introduced to this 
jjlH'^fnid by insertion of an Hae III restric- 
|joD endonuclease fragment (203 nucle- 
otides! carrying the lac promoter, ca- 
ifeibolile-gene-activator-protein binding 
;Sjle, operator, ribosome binding site, and 
^ the first seven amino codons of the jS-ga- 
^^lattosid ise structural gene (9) (Figs. 3 
itid 1) The fiae III fragment was de- 
'fived from \plac5 DNA. The Eco Rl- 
vleaved pBR322 plasmid. which had its 
I tenmni Tcpaired with T4 DNA polymer- 
f JSC and deoxyribonucieotide triphos- 
* phaies was blunt-end ligated to the 
["V Jfac 111 fn^ment to create Eco RI termi- 
i pi at the insertion points. Joining of these 
Hfte 111 md repaired Eco RI termini gen- 
1 " eiatL the Eco RI restriction site (Figs. 3 
I f and 4} at each terminus, Transformants 
I ' ol £ Lc/li RRl (S) with this DNA were 
tf^ fckcted for resistance to tetracycline 
} 

Fig 3 (lacing page, left)- Construction of recombinant plasmids, Plas- 
inid pBR122 whs used as the parental plasmid (8), Plasmid DNA (5 jxg) 
was digested with the restriction endonuclease Eco Rl, The reaction 
^iwtis terminated by extraction with a mixture of phenoi and chloro- 
form" the DNA was precipitated with ethanol and resuspended in 50 
^1 of T4 DNA polymerase buffer (26). The reaction was started by the 
addition of 2 units of T4 DNA polymerase. The reaction (held for 30 
Sninutes at 37°C) was terminated by extraction with phenol and chloro- 
•feSnn and precipitation with ethanol. The Xplac.^ DNA (J ^ig) was di- 
ijjiested with the endonuclease Hsie HI (8). The digested pBR322 DNA 
:was blunt-end ligated with the Hae lll-digested \plac5 DNA in a final 
*i)lume of 30 fil with T4 DNA ligase (hydroxylapatite fraction) {27] in 
^SO (nW tris-HCl (pH 7.6), 10 mM MgCis, 10 dithiothrcitol, and 
0 *; inM ATP for 12 hours at 12''C- The ligated DNA mixture was dia- 
gyyed against 10 mM tris-HCl (pH 7.6) and used to transform E. coli 
'^trim RRl (S). Transformants were selected for tetracycline resist- 
;atice (Tc^) and ampicillin resistance (Ap') on antibiotic (20 /ig/ral) 
minimal X-gal (40 fig/ml) medium (10). Colonies constitutive J'or the 
Synthesis of jQ-galactosidase were identified by their blue color. After 
?45 independently isolated blue colonies were screened, three of (hem 
■mert found to contain plasmids with two Eco Rl sites separated by 
approvimately 200 base pairs (2S). Plasmid pBHlO was shown to carry 
fragment in the desired orientation, that is, lac transcription going 
i'tiito the I'c' gene of the plasmid. Plasmid pBHlO was further modified 
io eliminate the Eco R! site distal to the lac operator and plasmid 
■^pBH20 was obtained (29). The nucleotide sequence from the Eco Rl 
sue mto the lac-control region of pBH20 (data not shown), was con- 
i^firmcd. This plasmid was used for cloning the synthetic somatostatin 
gene. Plasmid pBH20 (10 fig) was digested with endonucleases 
Ejco Rl and Bam HI and treated with bacterial alkaline phosphatase 
(t) I unit of BAPF, Worthington), and incubation wiis continued for 10 
Ktninutes at 65^C. The reaction mixtures were extracted with a mixture 
(if phenol and chloroform, and the DNA was precipitated with ethanol 
(SO) ^Somatostatin DNA (50 /it of a solution containing 4 /^g/ml) was 
istigated with the Bam HI-Eco Rl, alkaline phosphatitse-treated 
joiJHZO DNA in a total volume of 50 fii with the use of 4 uiiits of T4 
DNA ligase for 2 hours at 22''C (31). In a control esperiment, 
Bam H!-Rco RI alkaline phosphatase-treated pBH20 DNA was ligat- 
ed m the absence of somatostatin DNA under similar conditions. Both 
pivpirations were used to transform £. coli RRl. Transformants were 
s-34lected on minimal X-gal antibiotic plates. Ten Tc" transformants 
:5«'cre isolated- In the control experiment no transformants were ob- 
tained. Four out of the ten transformants contained plasmids with 
holt! «n Ew Rl and a Bam HI site. The size of the small Eco RI- 
ifelin HI fragment of these recombinant plasmids was in all four in- 
■sfailces similar to the size of the in vitro prepared somatostatin DNA. 
-&ase sequence analysis (12) revealed that the plasmid pSOM! had the 



treated with Eco RI and Bam HI restric- 
tion endonucleases to generate the soma- 
tostatin gene (Fig. 21, 

The synthetic somatostatin gene frag- 
ment with Eco Rl and Bam H) termini 
was ligated to the pBHZO piasmid, pre- 
viously treated with the Hco RI and 
Bam HI restriction endonucleases and 
alkaline phosphatase. The treatment 
with alkaline phosphatase provides a 
moletular selection for plasmids car- 
rying the inserted fragment (7/). Ampi- 
ciilin-resistant transformants obtained 
with this ligated DNA were screened for 
tetracycline sensitivity, and several were 
examined for the insertion of an Eco Rl- 
Bam HI fragment of the appropriate 
size. 

Both strands of the Eco Rl-Bam HI 
fragments of plasmids from (wo clones 
were analyzed by a nucleotide sequence 
analysis (12) starting from the Bam HI 
and Eco RI sites. The sequence analysis 
was extended into the lac-controlling ele- 
ments; the lac fragment sequence was in- 
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desired somatostatin DNA fragment inserted (data not shown). Be- 
cause of the failure to detect somatostatin activity from cultures car- 
rying plasmid pSOMl , a plasmid was constructed in which the soma- 
tostatin gene could be located at the COOH-terminus of the p-gatac- 
tosidase gene, keeping the translation in phase. For the constriiclion 
of snch a plasmid. pSOMl {50 |^g) was digested with restriction en- 
zymes Eco RI and Pst 1. A preparative 5 percent polyacryiamide gel 
was used to separate the large Pst i-Eco Rl fragment that carries the 
somatostatin gene from the small fragment carrying the lac control 
elements (12). In a similar way plasmid pBR322 DNA (50 fj.g) was 
digested with Pst 1 and Eco Rl restriction endonucleases, and the two 
resulting DNA fragments were purified by preparative electrophoresis 
on a 5 percent polyacryiamide gel- The small Pst !-Eco Rl fragment 
from pBR322 (1 fig) was ligated with the large Pst I-Eco Rl DNA 
fragment (5 /xg) from pSOMi. The ligated mixture was used to trans- 
form £. coli RRl, and transformants were selected for Ap'' on X-gal 
medium. Almost all the Ap' transformants (95 percent) gave white 
colonies (no lac operator) on X-gal indicator plates. The resulting 
plasmid, pSOMI 1 , was used in the construction of plasmid pSOMl 1- 
3. A mixture of 5 fig of pSOMl 1 DNA and 5 fj,g of XplacS DNA was 
digested with Eco RI. The DNA was extracted with a mixture of phe- 
nol and chloroform; the extract was precipitated by ethanol, and the 
precipitate was resuspended in T4 DNA ligase buffer (50 isi) in the 
presence of T4 DNA ligase (1 unit). The ligated mixture was used to 
transform B. coli strain RRl. Transformants were selected for Ap' on 
X-gol plates containing ampicillin and screened for constitutive ^-ga- 
lactosidasc production- Approximately 2 percent of the colonies were 
blue (such as pSOMil-1 and 11-2). Restriction enzyme analysis of 
plasmid DNA obtained from these clones revealed that all the plas- 
mids carried a new Eco RI fragment of approximately 4.4 megadal- 
tons, which carries the lac operon control sites and most of (he fi- 
galactosidase gene (13. 14). Two orientations of the Eco RI fragment 
are possible, and the asymmetric location of a Hind 111 restriction in 
this fragment can indicate which plasmids had transcription pro- 
ceeding into the somatostatin gene. The clones carrying plasmids 
pSOMll-3, pSOMll-5, pSOMll-6, and pSOMll-? contained the 
Eco RI fragment in this orientation. 

Fig. 4 (facing page, right). Nucleotide sequences of the lac-somatosta- 
tin plasmids- The nucleotide sequence of the lac control elements, ji- 
galactosidase structural gene, and the synthetically derived soma- 
tostatin DNA, are depicted (9, 14, 27) along with the restriction endo- 
nuclease substrate sites. The nucleotide sequence of pSOMl, as de- 
picted, was confirmed (legends to Figs, 3 and 5). The nucleotide' se- 
quence of pSOMll-3 was inferred from published data (9, 13, 14, 27). 
The amino acid sequence of somatostatin is italicized. The amino acid 
sequence numbers of /S-galattosidase are in brackets. 
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tact, and in one case, pSOMI, the nucle- 
otide sequence of both strands were in- 
dependently determined, each giving the 
sequence shown in Fig. 3. In the other 
case, the sequence was identical except 
for a base pair deletion (A- 1) at a posi- 
tion equivalent to the junction of the B-C 
oligonucleotides in the original DNA 
fragment. The basis for the deletion is 
unclear. 

The standard radioimmune assays 
(Rf A) for somatostatin (2) were modified 
by decreasing the assay volume and by 
using phosphate buifer {Fig. 6). This 



modification proved suitable for the de- 
tection of somatostatin in E. coli ex- 
tracts. Bacterial cell pellets, extracts, or 
cultures were treated overnight in 70 per- 
cent formic acid containing cyanogen 
bromide {5 mg/ml). Formic acid and cy- 
anogen bromide were removed under 
vacuum over KOH before the assay. Ini- 
tial experiments with extracts of E. coli 
strain RRl (the recipient strain) {10) in- 
dicated thai less than 10 pg of somatosta- 
tin could easily be detected in the pres- 
ence of 16 fig or more of cyanogen bro- 
mide-treated bacteria! protein. More 




Fig 1 Ligation and acrylamide gel analysis «f somatostatin DNA. Ttie 5'-OH termini of the 
chemically synthesized fragments A through H (Fig. 2a) were labeled and phosphorylated sepa- 
rately. Just prior to the kinasf: reaction. 25 |UC of [7-="PlATP (~ 1500 c/mmole) (12) was evapo- 
rated to dryness in 0.5-ml Eppendorf tubes. The fragment (5 ng) was incubated with 2 units of 
14 DNA kinase (hydroxylapatite fractitin. 2500 unit/ml) {26), in 70 mM tris-HCl, pH 7.6, 10 mM 
MgCl,, and 5 mM dilhiothreitol in a total volume of 150 ^il for 20 minutes at 37°C. To ensure 
tnaximum phosphorylation of the fragments for ligation purposes, 10 ftl of a mixture consisting 
of 70 mM tris-lICl, pH 7.6, 10 mM MgCh, 5 m/W dilhiothreitol, 0.5 mM ATP, and 2 units of 
DNA kinase were added, and incubadon continued for an additional 20 minutes at 37°C, The 
fragments (250 ng/^il) were stored at -20°C without further treatment. Kinase-treated fragments 
A, B, E, and F (1. 25 fig each) wereligated in a total volume of 50fi,lin 20 mM tris-HCl (/?H 7.6), 
to mM MgCIa, 10 mM dithiothreitol, 0.5 mM ATP, and 2 units of T4 DNA hguse (hydroxyl- 
apatite fraction, 400 uniUml) {26), for 16 hours at 4°C. Fragments C, D, G, and H were ligated 
under similar conditions. Samples (2 /liI) were removed for analysis by electrophoresis on a 10 
percent polyacrylamide gel and subsequent autoradiography {16) (lanes i and 2, respectively). 
The fast migrating matej ial represents unreacted DNA fragments. Material migrating with the 
bromophenol blue dye (BPB) is the monomerie form of the ligated fragments. The slowest 
migrating material represents dimers, which form by virtue of the cohesive ends, of the ligated 
fragments A, B, E, and F (lane 1) and C, D, G, and H (lane 2). The dimers can be cleaved by 
restriction endonucleasc Eco RI or Bam HI, respectively (data not shown). The two half mole- 
culesdigated A + B + E + F and ligated C -t D -t- G + H) were joined by an additional liga- 
tion step carried out in a final volume of 150 ^xl at 4°C for 16 hours. A sample (1 u-l) was removed 
for analysis (lane 3). The reaction mixture was heated for t5 minutes at 65°C to inactivate the T4 
DNA ligase. The heat treatment does not affect the migration pattern of the DNA mixture (lane 
4), Enough restriction endoniiclease Bam HI was added to the reaction mixture to cleave the 
multimeric forms of the somatostatin DNA in 30 minutes at 37°C (lane 5). After the addition of 
NaCl to a concentration of 100 mM, the DNA was digested with Eco RI endonuclease (lane 6). 
The restriction endonucleasc digestions were terminated by phenol-chloroform extraction of 
the DNA. The somatostatin DNA fragment was purified from unreacted and partially ligated 
DNA fragments by preparative electrophoresis on a 10 percent polyacrylamide gel. The band 
indicated with an arrow (lane 7) was excised from the gel, and the DNA was eluted by slicing the 
gel into small pieces and extracting the DNA with elution buffer (0.5M ammonium acetate, 
10 mM MgCl^, 0.1 mM EDTA, and 0.1 percent sodiUEn dodecyl sulfate) overnight at 65°C (12). 
The DNA was precipitated with two volumes of elhanoi, centrifiiged, redissolved in 20(* /il of 
10 mM tris-HCl {pB 7.6), and diaiyzed against the same buffer, resulting in a somatostatin DNA 
concentration of 4 ftg/mt. 



than 2 fig of protein from formic acid- 
treated bacterial extracts inteifercd 
somewhat by increasing the background 
but cyanogen bromide cleavage greatly 
reduced this interference. Reconstruc- 
tion experiments showed that somatosta- ^ 
tin is stable in cyanogen bromide-treated ■ 
extracts. 

The DNA sequence analysis of 
pSOMl indicated that the clone carrying 
this plasmid should produce a peptide ^ 
containing somatostatin. However, to 
date all attempts to detect somatostatin 
radioimmune activity from extracts of 
cell pellets or culture supematants have 
been unsuccessful. Negative results 
were also obtained when the growing 
culture was added directly to 70 percent 
formic acid and cyanogen bromide. We 
calculate that £". coli RRl (pSOMl) con- 
tains less than six molecules of soma- 
tostalin per cell. In a reconstruction ex- 
periment we have observed that exoge- 
nous somatostatin is degraded very 
rapidly by £. coli RRl extracts. The latl- 
ure to find somatostatin activity might be 
accounted for by intracellular degrada- 
tion by endogenous proteolytic en- 
zymes. 

If the failure to detect somatostatin ac- 
tivity from pSOMl was due to pro- 
teolytic degradation of the small protem 
{Fig. 4), attachment to a large protein 
might stabilize il. The /3-galactosidase 
stRictifral gene has an Eco RI site near 
the COOH-tcrmlnus (13). The available 
data on the amino acid sequence of this 
protein {13, 14) suggested that it would 
be possible to insert the Eco Rl-Bani HI ; 
somatostatin gene into the site and main- 
tain the proper reading frame for the coi- 
rect translation of the somatostatin gene 
. (Fig. 4). 

The construction of this plasmid is 
outlined in Fig. 3. The Eco Rl-Pst frag- 
ment of the pSOMl plasmid, with the: 
lac-controlling element, was removed 
and replaced with the Eco Ri-Pst frag- 
ment of pBR322 to produce the plas- 
mid pSOMU. The Eco RI fragment ot 
XplacS, carrying the lac operon control 
region and most of the j8-galactosidase 
structural gene, was inserted into the 
EcoRl site of pSOMU. Two orienta 
tions of the Eco RI lac fragment of 
\piac5 were expected. One of these ori- 
entations would maintain the proper 
reading frame into the somatostatin 
gene, the other would not. 

A number of independently isolated 
clones {with plasmid designation^ 
pSOMll-2 and pSOMll-3) were anii- 
lyzed for somatostatin activity, as de- 
scribed above. In constrast to the results 
of experiments with pSOMl, four clones 
(pSOMll-3, 11-5, 11-6, and 11-7) were 
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"found to have easily detectable soma- 
j[C>statin radioimmune activity (Fig. 6, a 
aaJ b i. Rcslriction fragment antiiysis re- 

^veated that pSOMll-3, pSOMtl-5, 
pSOMll-6, and pSOM 11-7 had the de^ 
^iroti orientation of the lac operon, 

^ whereas pSOMll-2 and 11-4 had the op- 
jjosite orientation. Thus, there is a per- 

} feet correlation between the correct ori- 
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entation of the lac operon and the pro- 
duction of somatostatin radioimmune 
activity. 

The design of the somatostatin plas- 
mid predicts that the synthesis of soma- 
tostatin would be under the control of 
the tac operon. The lac repressor gene is 
not included in the piasmid, and the re- 
cipient strain (E. coli RRl) contains the 
wild-type chromosomal lac repressor 
gene, which produces only 10 to 20 re- 
pressor molecules per celt (15). The pias- 
mid copy number (and therefore the 
number of lac operators) is approximate- 
ly 20 to 30 per cell and complete repres- 
sion is impossible. The specific activity 
of somatostatin in E. coli RRl (pSOMl 1- 
3) was increased by IPTG, an inducer of 
the lac operon (Table 1). As expected, 
the level of induction was low, varying 
from 2.4- to 7-fo!d. In experiment 7 
(Tabic 1), the a activity {M}, a measure 
of the first 92 amino acids of /3-galactosi- 
dase, also was induced by a factor of 2. 



In several experiments (Table 1 and oth- 
er experiments not shown), no soma- 
tostatin radioimmune activity was de- 
lected prior to cyanogen bromide cleav- 
age of the total cellular protein. Since the 
antiserum used in the radioimmune as- 
say, S39, requires a free NHg-terminal 
alanine, no activity was expected prior 
to cyanogen bromide cleavage. After 
cleavage by cyanogen bromide, cell ex- 
tracts were chromatographed on Seph- 
adex O-50 in 50 percent acetic acid (Fig. 
6c). In this system, somatostatin is well 
separated from excluded large peptides 
and fully included small molecules. Only 
extracts of clones positive for somatosta- 
tin exhibited radioimmune activity in tbe> 
column fractions, and this activity elutes 
in the same position as chemically syn- 
thesized somatostatin. 

The strains carrying the Eco RI lac op- 
eron fragment (such as pSOMn-2 and 
pSOMll-3) segregate with respect to the 
piasmid phenotype. For example, after 
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^ Iig 6. Radioimmune assay for somatostatin activity. The assay used is 
amodification of existing methods (2). tTyr"jSoinatostatin (a gift from 
)^ \V Vale) was iodinated by a chloramine T procedure (2). To assay for 
J Somatostatin, the sample, usually in 70 percent formic acid containing 
i- cy inogen bromide (5 mg/ml), was dried in a conical polypropylene tube 
if'^O 7 ml, Sarstedt) over moist KOH under vacuum. Then 20 ^il of PBSA 
f^vtufter [75 ttiM NaCi; 75 mM sodium phosphate (pU 7.2; bovine serum 
' itjbumin (1 Mg/ml): and sodium azide (0.2 mg^ml)] was added, followed 
by 40 fi\ of a ['"Ijsomatostatin mixture and 20 |Lti of a 1000-fold dilution 
' in PBSA of rabbit antiserum to somatostatin S39 (2) obtained from W.' 
,j Vale The ['"II somatostatin mixture contained (per milliliter of PBSA 
bufter) 2.^0 Atg of normal rabbit gamma globulin (Antibodies, Inc.), 1500 
un^ts of Trasylol (Calbiochem), and about 100,000 counts of [Tyr"- 
^' " !]somatostatin. After at least 16 hours at room temperature, 0.333 ml 
iof goat antibody to rabbit gamma globulin (Antibodies, Inc., F .03) 
>:in PBSA buffer was added to the sample tubes. The mixture was incu- 
bated for 2 hou rs at 37°C . c ooled to 5°C , then centrifuged at 1 0,000^ for 5 
itninutes, The supernatant was removed, and the radioactivity in the pel- 
&t was counted in a gamma counter. With the ajnount of antiserum used, 
iSO percent of the counts was precipitated with no unlabeled competing 
ssomatostatin. The background with infinite somatostatin (200 ng) was 
asually 3 percent. One-hatf maximum competition was obtained with 
jIO pg of somatostatin. <a) Competition by bacterial exti-acts. Strains 
t coli RRl (pSOMn-5) and E. coli RRl (pSOMl 1-4) were grown at 
37-0 to 5 X 10^ cell/ml in J. broth. Then IPTG was added to a concen- 
tration of I mM and growth continued for 2 hours. Portions (1 ml) were 
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centrifuged for a few seconds in an Eppendorf centrifuge, and ttie pel- 
lets were suspended in 500 /j.1 of 70 percent formic acid containing 
cyanogen bromide (5 mg/ml). After approximately 24 hours at room 
temperature, the samples were diluted tenfold in water, and the indica- 
ted volumes were assayed in triplicate for somatostatin. BIBg is the 
ratio of I'^^Hsomatostatin bound in the presence of sample to that 
bound in the absence of competing somatostatin. Each poim is the 
average of triplicate tubes. The protein content of the undiluted sam- 
ples were determined to be 2.2 mg/ml for E. coli RRl (pSOMH-5) 
and 1.5 mg/ml for coli RRl (pSOM-4). (b) The initial screening of 
clones for somatostatin. Cyanogen bromide-treated extracts of II 
clones (such as pS0M11-2 and pSOM11-3) were made as described 
above for (a). A sample (30 fii) of each extract was taken in triplicate 
for radioimmune assay. The range of assay points is indicated. The 
values for picograms of somatostatin were read from a standard cui ve 
obtained as part of the same experiment, (c) Gel filtration of cyanogen 
bromide-treated extracts. Formic acid and cyanogen-treated extracts 
of the positive clones (11-3, 11-5, li-6, and U-7) were pooled <lotal 
volume, 250 |Ul), dried, and resuspended in 0. 1 ml of 50 percent acetic 
acid. ['H]Leucine was added, and the sample was applied to a column 
(0.7 by 47 cm) of Sephadex G-50 in 50 percent acetic acid. Portions 
(50 fj.1) of the column fractions were assayed for somatostatin. Pooled 
negative clone extracts (11-2, 11-4, and 11-11) were treated identically: 
On the same column known somatostatin (Beckman Instruments, Inc.) 
elutes as indicated (i'S). 
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Table I. Somatostatin Tadioimmune specific activity. Abbreviations; LB, Luria broth, IPTG, 
isopropylthiogalactoside; CNBr, cyanogen bromide; SS, somatostatin. Protein was measured 
by the nnethod of Bradford (i2). 



Experiment 


Strain 


Medium 


IPTG 
1 mM 


CNBr 
5 mg/ml 


SS/protein 
(pg/mg) 


! 


11-2 


LB 


+ 


+ 


< 0.1 




U-3 


LB 


-t- 


+ 


12 




il-4 


LB 


+ 


+ 


< 0.4 




11-5 


LB 


+ 


+ 


15 


2 


11-3 


LB 


-1- 


+ 


12 




11-3 


LB 


+ 




< 0,1 


3 


11-3 


LB 


+ 


- 


61 




11-3 


LB 


- 


+ 






11-3 


LB 


+ 




< 0-1 


4 


11-3 


LB 


+ 


+ 


71 




11-3 


VB + glycerol* 


+ 


+ 


62 


5 


11-3 


LB -1- glycerol 


+ 


+ 


250 


■ 6 


11-3 


LB 


+ 


+ 


320 




11-2 


LB 


+ 


+ 


< O.i 


7 


11-3 


. LB 


-1- 


+ 


24 




1U3 


LB 




+ 


10 



•Voget-Bonner minimal medium plus glycerol. 



about i.5 generations, about one-half of 
the E. co/i RRl (pSOMn-3) culture was 
Qonstittitive for j3-galactosidase, that is, 
carried the lac operator, and about half 
of the noncenstitutive colonics were 
ampicHlm-Sensitive. Strains positive 
{pSj^MUr3) and negative (pSC)MIl-2) 
foI^f■ .soms^lostatit) are unstable, and, 
therefore j. Ihc growth disadvantage pre- 
sumably comes from the overproduction 
of the large but incomplete and inactive 
galactosidase. The yield of somatostatin 
has varied from 0,001 to ^).03 percent of 
the totnl cellular protein (Table 1) prob- 
ably as the result of ihe selection for cells 
in culture having plasmids with a deleted 
lac region. The highest yields of soma- 
tostatin have been from preparations 
where growth was started from a single 
Ap-resistant, constitutive colony. Even 
in these cases, 30 percent of the cells at 
harvest had deletions of the Sac region. 

Several moderate scale (up to 10 liters) 
attempts' have been made to purify so- 
matostatin from E. coli strain RRI 
(pSOMn-3). The initial purification 
scheme w;as based on known purification 
properties of /3-galactosidaise followed 
by p^irification of the cyanogen bromide 
cleavage products :of the chimeric pro- 
tein.' However, essentially all of the 
somaloslatin activity found in the cnide 
extract is insoluble and is found in the 
pellet from the first low speed centrifuga- 
lion. The activity can be solubilized in 70 
percent formic acid, 6M guanidinium hy- 
drochloride, 8/W urea; or 2 percent so- 
dium docecyl sulfate. Somatostatin ac- 
tivity has been enriched approximately 
100-foid from the cellular debris by cy- 
anogen bromide cleavage, and subse- 
quent alcohol extraction and chromatog- 
raphy oh Sephadex G-50 in 50 percent 
acetic acid. 
1062 



Recent improvements in the chemical 
synthesis of DNA provide the opportu- 
nity to synthesize quickly DNA with bio- 
logical interest for genetic manipulation 
and experimentation. As illustrated ear- 
tier {16, IJ), in vitro recombinant DNA 
techniques and molecular cloning en- 
hance the experimental value of chem- 
ically synthesized DNA, There arc two 
well-established methods for the syn- 
thesis of DNA. The phosphodtestcr 
method of Khorana and co-workers {}8] 
and the more recently developed modi- 
fied phosphotriesler method (7). Both 
methods are ciipable of producing func- 
. tional DNA {S6, 17, 19, 20); however, the 
triester method is probably faster. More- 
over, a method for rapidly synthesizing 
ti'imer blocks (codons) as bjilding units 
for longer oligodeoxyiibonucieotides 
(21) (I'ig. 2b) has increased the speed of 
the triester method. From the trimer 
block library, a hexadecadeoxyribonu- 
clcolidc now can be obtained in a week. 
We have established here that the DNA 
made with this improvement is function- 
al. 

The data establishing the synthesis of 
a polypeptide containing the somatosta- 
tin amino acid sequence are summarized 
as follows, (i) Somatostatin radioimmune 
activity is pi"esent in E. coli cells having 
the plasmid pSOMl 1-3, which contains a 
somatostatin gene of proven correct se- 
quence an3 has (he correct orientation of 
the lac Eco Rl DNA fragment. Cells 
with the related plasmid pSOMll-2, 
which has the same somatostatin gene 
but an opposite orientation of the lac 
Eco Rl fragment, produce no detectable 
somatostatin activity, (ii) As predicted 
by the design scheme, no detectable 
somatostatin radioimmune activity is ob- 
sensed until after cyanogen bromide 
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treatment of (he cell extract, (iii) The 
somatostatin activity is under conlrol of 
the lac operon as evidenced by indtJCtion 
by IPTG, an inducer of the lac operon. 
(iv) The somatostatin activity cochro- 
matographs with known somatostatin on 
Sephadex G-50. (v) The DNA sequence 
of the cloned somatostatin gene is cor- 
rect. If translation is out of phase, a pep- 
tide will be made which is different from 
somatostatin at every position. Radio- 
immune activity is detected indicating 
that a peptide closely related to soma- 
tostatin is made, and translation must be 
in phase. Since translation occurs in 
phase, the genetic code dictates that a 
peptide with the exact sequence of sorq^i 
tostatin is made, (vi) Partially piiritied 
samples have been independently as- 
sayed by W. Vaie (Salk institute). He 
has confirmed our radioimmune activity 
with both antiserum S39, which is direct- 
ed by the NH^-tcrminal, and with anti- 
senim S201 which interacts mainly with 
somatostatin positions 6 through 14. (vii) 
Finally, the above samples of E. coli 
RRI (pSOMIl-3) extract inhibit the re- 
lease of growth hormone from rat pitui- 
tary celts, whereas samples of E. coli 
RRI (p.SOMl l-2) prepared in parallel 
and with identical protein concentration 
have no effect on growth hormone [e^ 
lease {22). 

Our results represent the first success 
in achieving expression (that is, tran- 
scription into RNA and translation of 
that RNA into a protein of a designed 
amino acid sequence) of a gene of chem- 
ically synthesized origin. The large num- 
ber of plasmid molecules per cell results 
in a substantial amount (at least 3 per- 
cent) of the cellular protein as the fi-ga.- 
lactosidase-somatostatin hybrid. This 
molecule appears to be relatively resisl- 
ant to endogenous proteolytic activity. 
There, is evidence that abnormally short 
/3-galactosidase peptides are degraded in 
E. coli (14) suggesting that the hybrid 
protein molecule expected from the first 
somatostatin-lac plasmid (pSOMl) is al- 
so rapidly degraded. The synthesis of 
many gratuitous proteins in E. coh^ 
whether large enzymes or smaller poly- 
peptides, may be undetectable for thi!> 
reason. In cases where the amino add 
composition of the protein is appropn- 
ate, the precursor technique described 
here can be employed. This approacli 
could possibly be extended by taking ad- 
■ vantage of proteolytic enzymes with 
amino acid sequence specificity. 

The amount of somatostatin synthe- 
sized was variable and about a factoi of; 
10 less than the maximum predicted: 
yield. This variability could be inte' 
preted in several ways. Protein degrade- 
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iion by endogenous proteases, thfe inabil- 
ity to fully solubilize the chimeric (jro- 
teiii, and the selection of altered 
plasmids could all be contributing factors 
[o the variability in yield. Although re- 
cotribitiant DNA experiments with 
chemically synthesized DNA are inher- 
ently less hazardous than those with 
PNA from natural sources, consid- 
cialion should be given to the possible 
toxicity of the peptide product. A major 
: factor in the choice of somatostatin was 
Its proven low toxicity (3). In addition, 
the experiment was deliberately de- 
signed to have the cells produce not free 
somatostatin but rather a precursor, 
which would be expected to be relatively 
inactive. The cloning and growth of cell 
tiiltures were performed in a P-3 con- 
tainment facility. 
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Cytidine 3',5'-Monophosphate (Cyclic (CMP) 
Formation in Mammalian Tissues 

Abstract. Mammalian tissues possess the capacity to synthesize cytidine 3',5'- 
monophosphate (cyclic CMP) via the enzymatic conversion Mf cytidine 5'-triphos- 
phate to cyclic CMF by cytidylate cyclase. Cyclic CMP formation occurs best in the 
presence of manganese or iron, at neutral pH, at 3T^C, in the absence of detergents, 
and with whole tissue homogenate fractions. Thus, mammalian tissues are capable 
of synthesizing not only cyclic AMP and cyclic GMP, but also cyclic CMP. 



Adenosine 3 ',5 '-monophosphate (cy- 
clic AMP) and guanosine 3',5'-mono- 
phosphate (cyclic GMP) are purine cy- 
clic nucleotides that are generjtlly 
thought to influence or regulate numer- 
ous cell functions and biological events. 
In many instances, however, alterations 
in cell function cannot be accounted for 
by corresponding or concomitant altera- 
tions in the tissue concentrations of ei- 
ther of the two purine cyclic nucleotides. 
Therefore, the existence of other endog- 
enous regulatory molecules is constant- 
ly being sought. Cytidine 3',5'-niono- 
phosphate (cyclic CMP) was first identi- 
fied in cells (leukemia L-1210) by Bloch, 
who demonstrated also that the addi- 
tion of exogenous cyclic CMP to L- 
1210 cells in culture abolishes the char- 
acteristic temperature-dependent lag 
phase and stimulates the resumption of 
growth or proliferation of these leukemic 



cells {1, 2). These experimental findings 
suggest that cycUc CMP, a pyrimidine 
cyclic nucleotide, may play a biologic 
role in the control of proliferation of leu- 
kemic cells. 

Shortly after the discovery of the natu- 
ral Occurrence of cyclic CMP in certain 
leukemic cells, an enzymfe system ca- 
pable of forming cyclic CMP from its 
naturally occurring substrate was found 
in murine myeloid leukemic tumors and 
in normal mouse liver and spleen (.^). 
Thus, oiir experimental findings on the 
capacity of mammalian tissues to synthe- 
size cyclic CMP support those of Bloch 
on the identification of cyclic CMP in 
malignant cells. 

The properties and biologic impor- 
tance of cytidylate cyclase in normal 
and malignant mammalian tissues were 
recently reported briefly (4). At the same 
time Cailla and Delagge reported on the 
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abstract We have cloned double-stranded cDNA copies 
of a rat preproinsulin messenger RNA in Escbericbia coli xl776, 
using the unique Pst endonuclease site of plasmid pBR322 that 
lies in the region encoding amino acids 181-182 of penicillinase. 
Hiis site was reconstructed by inserting the cDNA with an oli- 
go(dC)^>ligo(dC) joining procedure. One of the clones expresses 
a fused protein bearing both insulin and penicillinase antigenic 
determinants. The DNA sequence of this plasmid shows that 
the insulin region is read in phase; a stretch of six glycine resi- 
dues connects the alanine at position 182 of penicillinase to the 
fourth amino acid, glutamine, of rat proinsulin. 

Can the structural information for the production of a higher 
cell protein be inserted into a plasmid in such a way as to be 
expressed in a transformed bacterium? To attack this problem, 
we used as a model rat insulin, an interesting protein that can 
be identified by immunological and biological means. 

Although mature insulin contains two chains, A and B, it is 
the product of a single longer polypeptide chain. The hormone 
is initially synthesized as a preproinsulin structure (1, 2). A 
hydrophobic leader sequence of 23 amino acids at the amino 
terminus of the nascent chain is cleaved off, presmnably as the 
polypeptide chain moves through the endoplasmic reticulum 
(2-4), producing a proinsulin molecule. The proinsulin chain 
folds up and then the C peptide is cleaved from its middle (5). 
Thus each of the two (nonallelic) insulin genes in the rat (6-8) 
encodes a polypeptide 109 amino acids long, whose initial 
structure is NH2 — leader sequence — B chain — C peptide — 
A chain. 

Ullrich et al. (9) have cloned double-stranded cDN A copies 
of rat preproinsulin mRNA isolated from pancreatic islets and 
determined sequences covering much of those two genes. We 
have made double-stranded cDNA copies of mRNA from a rat 
insulinoma (10) and cloned these in the Pst {Providenda 
stuartii endonuclease) site of pBR322 (11), which lies within 
the penicillinase gene. 

The Escherichia coli penicillinase is a periplasmic protein, 
the gene for which was recently sequenced (12). Penicillinase 
is synthesized as a preprotein with a 23 amino acid leader se- 
quence (12, 13), which presmnably serves as a signal to direct 
the secretion of the protein to the periplasmic space, and is re- 
moved as the protein traverses the membrane. Insertion of the 
structural information for insulin into the penicillinase gene 
should cause expression of the insulin sequence as a fusion 
product transported outside the cell. 

MATERIALS AND METHODS 

Bacterial Strains. E. coli K-12, strain HBlOl [hsm-, hrs', 
recA~, gal~, pro~, str^ (14)] was initially obtained from H. 
Boyer. E. coli K-12 strain xl776 (15) (F", tonA53, dapD8, 
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minAl, supE42, A4(^gal-uvrB], X", minB2, rfb-2, nalA25, 
oms-2, thyA57, metC65, oms-1, A29[fctoH-asd], cycB2, cycAl, 
hsdR2) was provided by R. Curtiss. 

DNA and Enzymes. pBR322 DNA, a gift from A. Poteete, 
was used to transform E. coli HBIOI. Plasmid DNA was puri- 
fied according to the procedure of Clewell (16). Avian myelo- 
blastosis virus reverse transcriptase (RNA-dependent DNA 
polymerase), E. coli DNA polymerase I, and terminal trans- 
ferase were gifts from T. Papas, M. Goldberg, and J. Wilson, 
respectively. Restriction enzymes were purchased from Be- 
thesda Research Labs and New England BioLabs. 

RNA Purification. An x-ray-induced, transplantable rat beta 
cell tumor (10) was used as source of preproinsulin mRNA. 
Tumor slices (20 g per preparation) were homogenized, and a 
cytoplasmic RNA (about 2 mg/g of tissue) was purified from 
a postnuclear supernatant by Mg2+ precipitation (17), followed 
by extraction with phenol and chloroform, and enriched for 
poly(A)-containing RNA by oligo(dT)-cellulose chromatogra- 
phy (18). About 4% of the material binds to the column (data 
from eight preparations). Further purification of the oli- 
go(dT)-cellulose-bound material by sucrose gradient centrif- 
ugation and/or polyacrylamide gel electrophoresis showed that 
the preproinsulin mRNA was a minor component of the 
preparation. 

Double-Stranded cDNA Synthesis. Oligo(dT)-cellulose- 
bound RNA was used directly as template for double-stranded 
cDNA synthesb (19), except that a specific p(dT)8dG-dC primer 
(Collaborative Research) was utilized for reverse transcription. 
The concentrations of RNA and primer were 7 mg/ml and 1 
mg/ml, respectively. All four [a-32p]dNTPs were at 1.25 mM 
(final specific activity 0.85 Ci/mmol). The reverse transcript 
was 2% of the input IINA, and 25% of it was finally recovered 
in the double-stranded DNA product. 

Construction of Hybrid DNA Molecules. pBR322 DNA (5.0 
Hg) was linearized with Pst, and approximately 15 dC residues 
were added per 3' end by terminal transferase at 15° in the 
presence of 1 mM Co2+ (20) and autoclaved gelatin at 100 
Aig/ml. Similarly, dC residues were added to 2.0 ng of dou- 
ble-stranded cDNA, which was then electrophoresed in a 6% 
polyacrylamide gel. Following autoradiography, molecules in 
the size range of 300 to 600 base pairs (0.5 ^tg) were eluted from 
the gel (21). Size selection was done after tailing rather than 
before because previous experience had indicated that occa- 
sionally impurities contaminating DNA extracted from gels 
inhibits terminal transferase. The eluted double-stranded cDNA 
was concentrated by ethanol precipitation, redissolved in 10 
mM Tris-HCl at pH 8, mixed with 4 fig of dC-tailed pBR322, 
and dialyzed versus 0.1 M NaCl/10 mM EDTA/10 mM Tris, 
pH 8. The mixture (4 nd) was then heated at 56° for 2 min, and 
annealing was performed at 42° for 2 hr. The hybrid DNA was 
used to transform E. coli xl776. 

Transformation and Identification of Clones. Transfor- 
mation of E. coli xl776 (an EK2 host) with pBR322 (an EK2 
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vector) was performed in a biological safety cabinet in a P3 
physical containment facility in compliance with NIH guide- 
lines for recombinant DN A research published in the Federal 
Register [(1976) 41, 27902-27943]. 

xl776 was transformed by a transfection procedure (22) 
adapted to xl776 by A. Bothwell (personal communication) 
and slightly modified as follows: xl776 was grown in L broth 
(23) supplemented with diaminopimelic acid at 10 ^g/ml and 
thymidine (Sigma) at 40 /tg/ml to OD590 of 0.5. Cells (200 ml) 
were sedimented at 500 X g and resuspended by swirling in 
1/lOth vol of cold buffer containing 70 mM MnCla, 40 mM Na 
acetate at pH 5.6, 30 mM CaCl2, and kept on ice for 20 min. 
The cells were repelleted and resuspended in Vaoth of the 
original volume in the same buffer. Two milliliters of the an- 
nealed DNA preparation was added to the cells. Aliquots of this 
mixture (0.3 ml) were placed in sterile tubes and incubated on 
ice for 60 min. The cells were then placed at 37° for 2 min. 
Broth was added to each tube (0.7 ml) and the tubes were in- 
cubated at 37° for 15 min; 200 nl of the cells was spread on 
sterile nitrocellulose filters (Millipore) overlaying agar plates 
containing tetracycline at 15 tig/ ml. (The filters were boiled 
to remove detergents before use.) The plates were incubated 
at 37° for 48 hr. Replicas of the filters were made by a proce- 
dure developed by D. Hanahan (personal communication): The 
nitrocellulose filters containing the transformants were removed 
from the agar and placed on a layer of sterile Whatman filter 
paper. A new sterile filter was placed, on top of the filter con- 
taiiung the colonies and pressure was applied with a sterile 
velvet cloth and a rephca block. A sterile needle was used to 
key the filters. The second filter was placed on a new agar plate 
and incubated at 37° for 48 hr. The colonies on the first filter 
were screened by the Grunstein-Hogness technique (24), using 
as probe an 80-nucleotide-long fragment produced by Hae III 
digestion of high specific activity cDNA (9). Positive colonies 
were rescreened by hybrid-arrested translation (25) as described 
in the legend of Table 1. 

Radioimmunoassays. Two-site soUd-phase radioimmuno- 
assays were performed (28). Cells from colonies to be tested 
were transferred with an applicator stick onto 1.5% agarose 
containing 30 mM Tris-HCl, pH 8, lysozyme at 0.5 mg/ml, and 
10 mM EDTA; released antigen was adsorbed to an IgG-coated 
polyvinyl disk during a 1-hr incubation at 4° . The wash buffer 
contained streptomycin sulfate at 300 fig/m\ and normal guinea 
pig serum (Grand Island Biological Co.) instead of normal 
rabbit serum. Guinea pig antiserum to bovine insulin was 
purchased from Miles Laboratories. 

Standard (liquid) radioimmunoassays were performed using 
the back titration procedure employing alcohol precipitation 
of insulin-antibody complexes (29). 

DNA Sequencing. DNA sequencing was performed as de- 
scribed by Maxam and Gilbert (30). 

RESULTS 

Construction and Identification of cDNA Clones. We 
isolated poly(A)-containing RNA from a transplantable rat 
insulinoma. This preparation contained preproinsulin mRNA, 
because it directed the synthesis in a cell-free system of a 
product precipitable with anti-insulin antibody (data not 
shown). However, the mRNA yield after further purification 
was not sufficient for cloning, and therefore we decided to ck>ne 
cDNA synthesized from the total preparation. In an attempt 
to enrich the reverse transcript for insulin sequences, we utilized 
the DNA sequence reported by Ullrich et al. (9) to choose a 
specific primer, (dT)8dG-dC. The product of double-stranded 
cDNA synthesis (19) was extended by a short oligo(dC) tail 
about 15 nucleotides in length, and sized on a polyacrylamide 



Table 1. Hybrid-arrested translation and immunoprecipitation 
of the cell-free products 

Radioactivity, cpm/20 ii\ 

Immuno- % 
Source of Acid precipitable Immuno- 

arresting DNA insoluble — Insulin + Insulin precipitable* 



Control I 










(-DNA, -RNA)t 


2,570 








Control II 










(-DNA, +RNA)J 


35,700 


12,300 


310 


36.2 


pBR322 


28,800 


7,850 


245 


29.0 


Clone 3 


15,100 


3,630 


264 


26.9 


Clone 13 


19,600 


5,190 


350 


28.4 


Clone 15 


18,600 


4,850 


252 


28.7 


Clone 16 


29,200 


8,830 


247 


32.2 


Clone 17 


24,000 


6,700 


316 


30.0 


Clone 18 


15,900 


3,690 


251 


25.8 


Clone 19 


8,650 


587 


277 


5.0 


Clone 20 


15,100 


4,070 


231 


30.6 


Clone 21 


21,100 


5,170 


223 


26.7 



Plasmid DNA (about 3 /ig) was digested with Pst, precipitated with 
ethanol, and dissolved directly in 20 ii\ of deionized formamide. After 
heating for one minute at 95° each sample was placed on ice. Following 
the addition of 1.5 ng of oligo(dT) -cellulose-bound RNA, pipera- 
zine-iV,N'-bis(2-ethane8ulfonic acid) (Pipes) at pH 6.4 to 10 mM, and 
NaCl to 0.4 M, the mixtures were incubated for 2 hr at 50°. They were 
then diluted by the addition of 75 ii\ of H2O and ethanol precipitated 
in the presence of 10 jig of wheat germ tRNA, washed with 70% (vol/ 
vol) ethanol, dissolved in H2O, and added to a wheat germ cell-free 
translation mixture (26) containing 10 /iCi of PH]leucine (60 Ci/ 
mmol). Fifty-microliter reaction mixtures were incubated at 23° for 
3 hr and then duplicate 2-ii\ aliquots were removed for trichloroacetic 
acid precipitation. From the remainder two 20-/tl aliquots were treated 
with ribonuclease, diluted with immunoassay buffer, and analyzed 
for the synthesis of immUnoreactive preproinsulin by means of a 
double antibody immunoprecipitation (27) in the absence or presence 
of 10 ng of bovine insulin. The washed immunoprecipitates were 
dissolved in 1 ml of NCS (Amersham) and assayed in 10 nl of Omni- 
fluor (New England Nuclear) by liquid scintillation counting. 
* Calculated using the formula [(immunoprecipitable radioactivity 
in the absence of insulin) — (immunoprecipitable radioactivity in 
the presence of insulin)]/[(acid-insoluble radioactivity) — (acid- 
insoluble radioactivity of control I)]. 
^ Reaction mixture incubated in the absence of added RNA. 
' Cell-free translation by the direct addition of oligo(dT) -cellulose- 
bound RNA into the reaction mixture. 

gel. A broad size cut averaging 500 base pairs was selected in 
order to enrich for full-length sequences. We inserted these 
molecules i^to the Pst site of pBR322 after elongating the 3'- 
terminal extension of the cleavage site with oligo(dG). We used 
this oligo(dG)'oligo(dC) joining procedure in order to recon- 
struct the Pst recognition sequence (ref. 31; W. Rowenkamp 
and R. Firtel, personal communication); approximately 40% 
of the inserts were excisaUe with Pst after cloning. From about 
0.25 Mg of tailed cDNA we obtained 2355 transformants in E. 
coli strain xl776. To identify clones containing insulin se- 
quences, we first screened one-third of the transfcHrmants, using 
as a probe an 80-nucleotide-long Hae III fragment of cDNA 
synthesized from oligo(dT)-bound RNA because the results of 
Ullrich et al. (9) suggested that such a fragment should be in- 
sulin specific. About 20% of the clones were positive, but re- 
striction analysis of plasmid DNA from a few candidates 
showed that the inserts were not insulin sequences. We con- 
cluded that our probe was not pure and rescreened some of the 
positive clones, using hybrid-arrested translation (25). This 
method is based on the principle that mRNA in the form of an 
RNA-DNA hybrid does not direct cell-free protein synthesis. 
We incubated aliquots of oligo(dT)-bound RNA with linearized 
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Fig. 1. Restriction map of the insertion in clone pI19. Each restriction site is identifled by a number indicating the 5'-terminal nucleotide 
generated by cleavage at the message strand. Nucleotides are numbered beginning with the first base of the sequence encoding proinsulin. Nu- 
cleotides in the 5' direction from position 1 in the message strand are identified by negative numbers, beginning with —1. Arrows indicate the 
sequenced fragments; those pointing to the left indicate sequences derived from the antimessage strand, and those pointing to the right indicate 
sequences derived from the message strand. The uniquely labeled restriction fragments were generated as follows: Following excision with Pst, 
DNA of the insertion was digested with Aua II and end labeled. Fragments A and C purified from a polyacrylamide gel were sequenced directly 
because the Pst ends do not label significantly. Fragment B was strand separated on a polyacrylamide gel and sequenced in both directions. 
The exact number of C-G pairs in the right-hand tail before the Pst site could not be counted. 



DNA from nine clones under conditions favoring DNA-RNA 
hybridization (32), added them to cell-free translation systems, 
and assayed for a specific inhibition of insulin synthesis. Table 
1 shows that one of the plasmids, pI19, inhibited the synthesis 
of immunoprecipitable material. Restriction endonuclease 
digestions of the Ps< -excised insert of pI19 with several enzymes 
generated fragments whose sizes were consistent with the se- 
quence of Ullrich et al. (9). We confirmed the presence of in- 
sulin DNA in pi 19 by direct DNA sequence analysis and 
screened the rest of the clones with purified pI19 insert labeled 
by nick translation. About 2.5% (48/ 1745) of the clones hybrid- 
ized strongly to this probe. There must have been enrichment 
for insulin sequence at some step of our procedure, because 
hybridization analysis using cloned insulin DNA as probe 
showed the presence of only 0.3% insulin mRNA in the original 
oligo(dT)-bound RNA. 

Sequence Information. Fig. 1 shows the restriction map of 
the insertion in clone pi 19 and Fig. 2 shows the sequence of the 
insert. It corresponds to rat insulin I (5, 33) and encodes the 
entire preproinsulin chain with the exception of the first two 
amino acid residues of the reported preregion (1). It therefore 
extends the sequence determined by Ullrich et al. (9) by 
twenty-five 5'-terminal nucleotides. It also verifies the reported 
amino acid residues for positions —14, —17, —18, and —20; it 
identifies the previously imcertain residue —15; and it identifies 
the unknown residue —19. However, the residues at positions 
—16 and —21 differ from those reported (1). 

The sequence deviates from that determined by Ullrich et 
al. (9) at the region immediately after the UGA terminator, 
where a GAGTC sequence occurs, predicting a Hfnf cleavage 



site that we have experimentally verified. Furthermore, only 
moderate agreement exists between the two sequences for the 
next 15 nucleotides of the 3' untranslated region. 

Eiqpression. Almost two-thirds of the clones carrying inserts 
were ampicillin resistant; thus the active site of penicillinase 
must lie between amino acid residues 23 and 182 (12). The 
degree of resistance was variable, suggesting the expression of 
different sequences from the inserts in the form of fused 
translation products, probably differing in length and stabili- 
ty- 

We therefore screened colonies of the 48 clones containing 
insulin sequence for the presence of insulin antigenic deter- 
minants, using a solid-phase radioimmunoassay (28). Polyvinyl 
sheets coated with antibody molecules will bind specific anti- 
gens released from bacteria. The inunobilized antigen can then 
be detected by autoradiography following exposure of the sheets 
to ^^I-labeled antibody. This method permits detection of as 
little as 10 pg of insulin in a colony. We coated plastic disks with 
anti-insulin antibody and used '^I-labeled anti-insulin to detect 
solely insuhn antigenic determinants. Disks coated with anti- 
penicilhnase antibody and exposed to ^^I-anti-insulin detect 
the presence of a fused protein, as do disks coated with anti- 
insulin and exposed to radioiodinated anti-penicillinase. 

One clone, pI47, gave positive responses with all of the 
combinations described above; this indicates the presence of 
a penicillinase-insuhn hybrid polypeptide. Fig. 3 shows some 
of the results. To determine whether this fused protein is se- 
creted, we grew clone pI47 in liquid culture and extracted the 
proteins in the periplasmic space by osmotic shock, a method 
that does not lyse bacteria (34). Fig. 4 shows that the insulin 



TrpMetArgPhe Le-uiPrc LeuLeuA laLeuLeul'alLeuTrfJluProLysProA la J I nA I a 

(c)tgcaggggggggggggggggtggatgcgcttcctgcccctgctggccctgctcgtcctctgggagcccaagcctgcccaggct 
(g)acgtc cccccccccccccccaccta cgcg aaggacggggacga ccgg gacgagcaggagaccctcgggttcggacgggtccga -1 

Pst Hhal Haelll 

PheValLysGlnHisLeuCysGlijProHisLeuValGluAlaLeuT-jrLeuValCysGl'jGluArgGlyPhePheT-jrThr'ProLysSer'A rgA rgGlu 

tttgtcaaacagcacctttgtggtcctcacctggtggaggctctgtacctggtgtgtggggaacgtggtttcttctacacacccaagtcccgtcgtgaa 

1 AAACAGTTTGTCGTGGAAACA CCAGG AGTGGACCACCTCCGAGACATGGACCACACACCCCTTGCACCAAAGAAGATGTGTGGGTTCAGGGCAGCACTT 99 
Avail MboII 

34 

ValGluAspProGlnValProGlnLeuGluLeuGlyGlyGliiProGluAlaGlgAspLeuGlnThrLeuAlaLeuGluValALaArgGlnLysAr^Glg 
GTGGAGGACCCGCAAGTGCCACAACTGGAGCTGGGTGGAGGCCCGGAGGCCGGGGATCTTCAGACCTTGGCACTGGAGGTTGCCCGGCAGAAGCGTGGC 
100 CACCTCCTXGGCGTTCACGGTGTTGACCTCGACCCACCTCCGGGCCTCCGGCCCC TAGAAG TCTGGAACCGTGACCTCCAACGGGCCGTCTTCGCACCG 198 
*"^11 «lul Haelll Haelll MboII Hpall 

Hpall Hpall 

e? S6± 

IleValABpGlnCysCysThrSerlleCysSet'LeuTyrGLnLeuGluAsnTyrCysAsn 

ATTGTGGATCAGTGCTGCACCAGCATCTGCTCCCTCTACCAACTGGAGAACTACTGCAACTGAGTCCACCACTCCCCGCCCCC 

199 TAACACCTAGTCACGACGTGGTCGTAGACGAGGGAGATGGTTGACCTCTTGATGACGTTGA CTCAG GTGGTGAGGGGCGGGGG 

Hinf 

Fig. 2. DNA sequence of the insertion in clone pI19. Nucleotides are numbered using the convention described in Fig. 1. Accordingly, amino 
acids are numbered beginning with the first amino acid of proinsulin, while the last amino acid of the leader sequence (pre region) is numbered 
as -1. Restriction endonuclease cleavage sites experimentally verified are underlined and identified. The arrows indicate, in order, the ends 
of the leader sequence and the peptides B, C, and A. Two nucleotides indicated by double underlining are uncertain. 
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Fig. 3. Initial detection of penicillinase-insulin hybrid poly- 
peptides in an insulin cDNA clone. Cells from colonies of the 48 insulin 
cDNA clones and from control colonies, xl776 and xl776-pBR322, 
were appUed to an agarose/lysozyme/BDTA plate. Positive controls, 
5 ng of insulin and 5 ng of penicillinase, each in 1 nl of wash buffer, 
also were spotted on plate. Antigen was adsorbed to an IgG-coated 
polyvinyl disk during a 1-hr incubation at 4°. Immobilized antigen 
was labeled by setting the plastic disk on a solution containing ra- 
dioiodinated anti-insulin IgG. The autoradiographs are of disks 
precoated with anti-insulin IgG (A) or anti-pencillinase IgG (B), ex- 
posed on Kodak X-Omat R film using a Du Pont Cronex Lightning 
Plus intensifying screen for 12 hr at -70°. The arrows indicate the 
signal generated by clone pI47. The large exposed area in the lower 
right of (A) is the positive control for insulin detection. 

antigen was recovered in the distilled water wash of the shock 
procedure. Table 2 shows that the insulin antigen in the wash 
is also detectable and quantifiable by a standard radioinunu- 
noassay. The yield of antigen depended on the growth medium; 
antigen was released by cells grown in M9/ gliKX»e/amjno acids 
medium but not by cells grown in brain/heart infusion. We 
estimate a recovery of about 100 molecules per cell. 

Structore of the Fused Protein. We sequenced pI47 to de- 
termine the sequence around the jimctions. Fig. 5 shows that 
a proinsulin I cDNA lies in the Pst site in the correct orientation 
and in i^iase, so that a fused {M'otein can be syntheazed. In pI19, 
the insert is in the correct orientation, but not in phase. In pI47 
the ol^o(dG)«ligo(dC) region encodes six glycines that connect 
the penicillinase sequence, ending at amino acid 182 (alanine), 
to the fourth amino acid (glutamine), of the proinsulin se- 
quence. The cDN A sequence in pI47 extends 26 base pairs past 
the UG A terminator. Thus, we infer the structure of the fused 
protein to be pemcillinase(24-182)-(Gly)6-proinsulin(4-86). 

DISCUSSION 

The coding regions of eukaryotic structural genes are often 
interrupted by introns (35-38), whose transcripts are sphced 
out of the mature mRNA. Because prokaryotes do not appear 
to process their messei^ers, double-stranded cDNA made f rem 
a mature messenger is the material of choice to carry eukaryotic 
structural information into bacteria. 

By using cDNA cloning technology and an extremely sen- 
sitive method to assay expression, we were able to construct a 
derivative of E. coli strain xl776 carrying an insulin gene se- 
quence and to detect the synthesis and secretion into the peri- 
plasmic space of a fused protein carrying antigenic determi- 
nants of both insulin and penicillinase. This was accomplished 
simply by inserting double-stranded cDNA carrying the 
structural information for insulin into a restriction site within 
the structural gene for penicillinase. Not only is the fused DNA 
sequence expressed as a chain of amino acids, but also the 
polypeptide folds so as to reveal insulin antigenic shapes. Thus 
we expect soon to be able to demonstrate biological function 
for this, or for a similar, fused protein. 

We anticipate that the joining of cDNA sequences to nu- 
cleotides that lie ahead of the Pst site in the penicillinase gene 



Table 2. Immunoreactive insulin concentration in distilled water 
wash of osmotic shock procediu^ 





Insulin, 




Exp. 


Munits/ml 


Cells/ml 


1 


318 


1.5 X 101" 


2 


166 


6.0 X 10* 


3 


386 


4.2 X 101" 



Duplicate 0.1-ml aliquots of each sample prepared as described in 
the legend to Fig. 4 were assayed (29) in a final volume of 0.4 ml using 
rat insulin standard, a gift from J. SchlichtkrulL One unit = 4&itg. The 
NaCl/Tris wash, the 20% sucrose wash, and the media of xl776-pI47 
as well as the water wash from osmotic shock of xl776-pBR322 gave 
values below tiie sensitivity of the assay (25 ftunits/ml). 

will also produce fused and secreted molecules. Moreover, if 
the fusion replaces the preproinsulin leader with that of peni- 
cillinase it is likely that the new protein will also be secreted by 
the E. coli cell and may even be correctly matiued by cleavage 
of the leader sequence. 

dearly, we have ex|^ited a graiaral method that should lead 
to the expression and secretion of any eukaryotic protein pro- 
vided another protein, such as penicillinase, wiU serve as a 
carrier, by virtue of its leader sequence. Moreover, the secretic» 
of the eukaryotic protein sequence to the periplasm or extra- 
cellular space will both permit its harvest in a purified form and 
probably eliminate intracellular sources of instabihty. 

Often just an expression <rf antigens is the goal In a "sho^un" 
screenii^ the existence of a fused protdn antigen could be used 
to identify transformants carrying desired eukaryotic gene 
fragments. On the other hand, the insertion a DNA fragment 
coding for surface antigenic determinants of a virus into a 
carrier protein should lead to the secretion of a fused protein 
that could serve as a vaccine, even though no entirely correct 
vims product is ever produced. 




Fig. 4. Release of insulin antigen from xl776-pI47 cells by os- 
motic shock. One liter of xl776-pI47 cells growing at 37° in M9 me- 
dium supplemented with 1 g of tryptone, 0.5 g of yeast extract, and 
0.5% glucose was harvested at a density of 5 X 10'' cells per ml and 
washed two times in 10 ml of cold 10 mM Tris-HCl, pH 8/30 mM 
NaCl. The cells were then osmotically shocked (34) in the following 
manner The final wash pellet was resuspended in 10 ml of 20% sucrose 
per 30 mM Tris-HCl, pH 8, at room temperature, made 1 mM in 
EDTA, shaken at room temperature for 10 min, centrifuged out, re- 
suspended in 10 ml of cold distilled water, shaken in an ice bath for 
10 min, and again pelleted. The resulting supernatant was termed the 
"water wash." As a control, 1 liter of xl776-pBR322 was grown and 
treated in a similar manner. Aliquots (1 mU of each fraction to be as- 
sayed for the presence of insulin antigen were applied to the surface 
of a 1.5% agar plate. (A) Positions of each fraction on the plate. M, 
medium; IW, first wash supernatant; 2W, second wash supernatant; 
S/T, sucrose/Ttis supernatant; OS, distilled water wash; I, insuUn. 
(B) Autoradiograph showing results of a two-site radioimmunoassay 
of these fractions. Antigen was adsorbed to a polyvinyl disk and la- 
beled by using anti-insuUn IgG. The labeled areas correspond to the 
water washes and the positive control (1 ng insulin). A spectropho- 
tometric assay for |3-galactosidase (23) indicated that no more than 
4% of cells lyse during this procedure. 
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182 4 
AspThrThTUetPToAlaGlyGlyGlyGlyalyGlyGlnHisLeuCysGlyProHisLeuValGluAlaLeuTyr' 
GACACCACGATGCCTGCAGGGGGGGGGGGGGGGGGACAGCACCTTTGTGGTCCTCACCTGGTGGAGGCTCTGTAC 
CTGTGGTGCTACG GACGTC CCCCCCCCCCCCCCCCTGTCGTGGAAACA CCAGG AGTGGACCACCTCCGAGACATG 
Pst ^ Avail 



86 183 
LeuGluAsnTyrCysAsn A laMetA laThrThrLeu 

CTGGAGAACTACTGCAACfTGAlGTCCACCACTCCCCGCCCACCCCTCTCCCCCTGCAGCAATGGCAACAACGTTG 
GACCTCTTGATGACGTTGA CTCAGG TGGTGAGGGGCGGGTGGGGAGAGGG GGACGTC GTTACCGTTGTTGCAAC 

Hinf o Pst 



Fig. 5. Partial DNA sequence of the insertion in clone pI47. Clone pI47 DNA was digested with Hiraf and two fragments, HI and H2 (ca. 1700 
and 280 base pairs long, respectively) were isolated. HI contains the amino-terminal portion of the penicillinase gene and the bulk of the cDNA 
insert. HI was digested with Ava II, end labeled, and digested again with Pst. A fragment 39 nucleotides long (fragment A, arrow) was isolated 
and sequenced. Fragment H2 was end labeled and digested with Alu I (which cuts at the region corresponding to amino acid 200 of penicillinEise). 
A fragment 88 base pairs long (fragment B, arrow) was isolated and sequenced. The termination sequence TGA is boxed. 
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Synthesis of an ovalbumin-like protein by 
Escherichia coli K12 ,„ 
harbouring a recombinant plasmid 

Oane Mercerea-Wialoo, Roy-, Bn^^^, Ax.. GarapiB, 
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A cio«ea DNA tran^ript of ovalbumin mRNA m^cuia 

fSnuckoMes away from the initiator codon, arui fused m 

iZt t^UnriiU of the Eschenchia coU ^g./^- 
phase to rnj s ^^^^ ^^^^^ g 

protein. — ^ 

duaion and cloning, in &c^.er^fc/a co^ W2^^^^^^^ ^.^ 
Origin. Howevejs the -pr«^^^^ 

bacterial eovirontnenl IS stiupoony craisa, 
isolated from the '"''<'^,«"''"7°^X''i,e7usly ^Tpressed in 

E. coi . There t\as Deen uu "^"^ = tran-xribed or trans- 
higher eukaryotes that ^P^^'^^ 

toted. Transcription can "^^^^f '"^^^^^ phage gene". It 
tion of gene sequences w.thm ^ ^ac^nf « ^^^^^ a poly- 
seems that the best way to P'""""^^ '""S 
peptide product i.by the ~ Lteria! gene. 

This approach has oeen"»c"'" ;„ the form of a 
ofpXdngthe_smaUh^^^^^ 

hybrid protein ■ EeiiecOTin6 fused to the end of 

aLnoaddslong. wassynth^«ed» i„ , 

'^'^''^' f'^^J:^^^^-^-^ ,he somatostatin 
amn,o aeids) ^'^f^^^^^^ ^yu^. as somatostatin contains 
moiety through a '"/^X lattot labile in certain conditions. 

itro"d'brers^^ 

treatment with cyanogen ^oroide (.io„i„e residues, so 

' 88880£22T2:i OX 9Z6V 866 £19 



..er deavame protein ^^^^^J^^^, 
possible applications °\'l'<'''^'^^l'^^^^f^lli fen somatostatin, 
^ly on the synthesis "J P^^i^P^be ch^ ^vn*-^'-'^ 
coded by gene sequenc^ ^J^l.ltv thtt at least some eukaryotic 
Pre-nt. The -cent ^^^^^^^^^^^ « leas^^ . , a 
genes are split mam u »" ' expression in E. coU. 
from the genome, are ea i!> ^"^^f''^'^^ to involve (1) enzy- 

cloning m B. ^"jj- „„,,.^s to the chicKen ovalbumin gene 
We have applied this f ^ „^io„ bacteria! strains 
sequence andwc ' ^^''^.^f^'^f ""^vSSumhi-Uke protein, 
which symhesise large .^^i^^ S^t^approach and 

S^nrbr^rpr^st 

The ovalbumin system 

i:ibnm.nisthemaioreg.white^^^^^^^^^^^ 

chain of 3S6 amino ^' '^^^ 
bondandundergoesafewpos.«^^^^^^^^^ 

oviduct, which facilitates its .solation in '^^"^^^^ 
mRNA is 1.859 nucleotides lonj l^^^^^^^ ^, , 

been determined Jjf ^""^^/^^n. followed by a 1.158- 
64-nucleotide long 5 "f,-^?'^^"„^J.^„onlcoding region D- 
longcodingregion anda6» 
Theovalbumm gene IS spilt m me ^g^ous 

have recently isolated. 

genomicDNAfragmen«wbichcod^ revealed the 

„KNA'-^ Anays. 0 heclo^^^^^^^ ,„ locajed 

presence of at least ^« j,k[S as the structure of the 

within translated regions of ov mRNA. ^ 

® M»«a>Ula» Jouriwls U4 1^ 

iSIDI IXSIO Hd eS^f'T lOOZ 6Z NVf 
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Hi. 1 Map of ovalbumin mRN A. Schematic repretentetion of ov 

(T) and non-translatrf (NT) regions, the AUG start signal 
(nucleotides 65-47) for protein synthesis, »nd the i 

36.31 

Hhal, M and S«I sites in the corresponding DNA sequenca 

ovalbumin gene «as likely to prevent its expression in E. co/i, we 
turned ourfrtention towards the DNA transcnpt of o^f RNA^ 
Humphries el al?', using a protocol devisai foi the <^^'^^\ 
globinfflRNA transcripts»-^°, first '«''f*«'',^P'fr 
0V2-1 , whtch carries about 1.730 base paitBof DNA t^ns°*ed 
from ov mRNA. Evidence has been presented that a faithful 
cCed DNA copy is obtained^ As ov mRNA is 1 859 nudeo- 
tides bna, pCRI ov2-l does iwt harbour aU the ov mRNA 
sequence Restriction mapping, DNA sequencing and elecUon 
SfcBJSCQpy analysis" "-^'^'. however, indi«.te that the missmg 
parts originate from untranslated regions of the mff^A 

As pCRIov2-l carries ati the sequence codingfor o^f^^i 
we first aslced whether the gene is expressed m C600r.nu 
foCRIov2-ll. Transcription of the ovalbumin sequence from 
^Lct strand could be demonstrated. However, the strain 
produced no ovalbumin-like protein, as l'«''«««''.^5L'^^!°"": 
munoassays (see below) which were negative even in wnd'hoM 
where less than one molecule per cell would have been de ected 
(OM.-P., unpublished). We tentatively conduded hat the 
ovalbumin sequence in pORIovM has been aj^^-dentally fused 
an unknown plasmid gene, probably out of the proper reading 
frame. 

Constraction of recombinant /ac-ov plasmlds 

We then decided to fuse the ovalbumin gene sequence to the 
Sgin^ng^the E. coU lac Z gene, which codes for ^-galacto- 
Xe Fuller*^ has generated a small DNA fragment of 205 
pairs, flanked by two £coRI sites, which cames, m addition 
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to the L8UV5 lac operator promoter region, the information for 
the arst eight amino adds of /3-galactosidase. (The L8UV3 
mutaUon makes the system.inscnsitive to catabolite repression.) 
This fragment was engineered by Charnay et aL into a 
AcI857p;ac5ilZL8UV5, the DNA of which has a single Ecom 
site located after the eighth amino acid of ^f-galactosidase m iSk 
lac sequence. Wc used this DNA to generate by digestion wuh 
EcoRI and HmdlH a DNA fragmenl of 2.85 kiiobases which 
carries the beginning of the Uxc s^^^^^f ' .■'T''f J^fS^f"* 
recombiued in vim with pBR322 DNA" digested by the same 
two enzymes, as outlined in Fig. 2, and doned in E. coll. Hie 
rcsultingplasmid, called pOMPO. was the vector u^d for further 
recombination experiments. pOMPO is about 7.1 kiiobases m 
size It makes the recipient host cell resistant to ampiciUm (but 
not to tetracycline) and toe constitutive (because lac repressor is 
overlitrated by the operator carried by the muKicopy plasmid), 
and has a single EcoRI site located near the heginnmg of the fac 

^Uxt^of ^L ovalbumin sequence carried by P^^^'f 
pCRIov2-l can be isolated in a restriction fragment of 2,430 
base pairs, moov. obtained by digestion of plasmid DNA with 
H/ifll'"' The analysis of cloned genomic parts of the oval- 
bumin gene" has demonstrated the existence, near the 5 end, of 
a she sensitive to Hhal, located very close (about 30 basc^) 
to an S«I site in a region present in pCRIov2-l fref . 3 ). When 
the nucleotide sequence of this re^on of ov mRNA estabhsbed 
by McReyaoIds et af.» became available, it seemed that there 
was no Hhal site (C3CGC) in the sequency. It occurred to 
however, that 35 nudeotides ahead of the &fl site dof 
5' end (nudeotides 116-120), there was a sequence, GOVOCA 
(nudeotide, 77-82). which could be Ranged to GGCGCA 
without modifying the amino acid sequence (GGU and UtrC 
both code for glydne). The assumption could be "tade that 
McReynolds et al}' and Humphries et al'' had worked on 
different geneUc variants and that the Hhalj\t^ 
,observcd would be located at nudeotides 78-Bl of ov mRNA, 
overlapping two triplets, GGC and GCA, which code or the 
fifth S rfxth amino adds of ovalbumin^ /'°"'t ^the v^v 
fragment extracted from pCRIov2-t should be cut at the very 
be^noing of the mRNA sequence coding tor ammo acids. 
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^ construction of plasmlds pOMPO PO"^^" ^^^^^^if^^^^^^ 
205-basepairfacseq«ence(dcs<.ibedtaref.32).n.o.^^^^ 

Intramolecular recombinationsutequartly yieMed^^^^^^^ -^^ lac sequence .s deputed ami 

sequence as shown m 500 ng of *d8S7pj«5L8UV54Z DNA i*^^^^^ ^y „ DNA ligasc in a Snal vdume of 15 ,.1 as usu»l _ 

MngofpBR322 DNA (shown asi)wered.ees.rfby _Ec»Wand« transfurmants were selected on tryptone plat« 

Aliquols were used to transform CaC3,-treated stmn 478 f f'^^fj^.^f 4^ ^ „r'). on which strains carrying the recombu^nt 
Alining ampiciUin (20 .gnir')^d 5^ro^^^^^^^ 

plasmid give bhK cdonies. Punfted pOMPOwas "Sfl^, restriction analysis using the mdicated sites {see Fig. 4y pOMPO « 

Fig. 3). ^e map of one representative ^^^^'^■fyT;;^tf^%a Ml e.periments\.=re carried out with ultrapun. hyperhydor w-^r 
atout 7.1 kUobases long, has smftle sites tor ficoRl, H.ndlH, 
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Bg. 3 Fusion of the ovaltumin 
sequence lo the beginning of the Ian Z 
gene. The DNA sequence of live 
BroRl-restricted lac Z gene fragment 
is shown in I wit* <he corresponding 
amiivo acid sequence". The first eight 
amino adds of ovalbumin and^their 
corresponding DNA seqnenc* are 
shown in II, with one diange at posi- 
tion 79 (CiG instead of T:A wtitten 
above the line) which creates arfte toi 
HhiA restriction endoouelease . The 
latter spUlx tlic DNA as indicated. 
Elimination of the single-stranded 
ends and blunt-end ligation produce 
the fused gene sequence shown m III, 
with the corresponding amino acid 
sequence. Single-stranded ends were 
elitninated in the case of pOtvTPO by 
diceslion with Si nuclease followed by 
a treatment with DNA polymerase I 
(to repair possible mistakes made by 
S,). Hham protruding ends were 
digested cither by Si nuclease or St 
nuclease followed by DNA poly- 
merase 1. or by DNA polymerase 1 
alone making use of the 3' 
activity of this c 
fbrlOminat65 
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ir by DNA polymerase 1 , _ 

IB use of the 3' exonuclase c„DT/,v.f il'<Uor4flminat37°C.ThercactionwasstOPpedbyheati^ 

act,vttyo.,n-»en^me. 1 M^of purifie^'pOMPODN^^^ 

fhr 1 a min at65 "C The mixture was adittSted to 3 mM ZnSOi. JU mM • .r fte DNA was dialysed against 10 mMTns-Ha 

^iJded After 15 minat37°C,lhe reaction was stopped bS' heating for 
^ .f tmMEDTMstepI). Partof thisprepar«ion(330 

VJm for 15 min at 15 X and then heated for 10 min at 65 C l»'«P ">J?^5('4r temrinated by extraction with a mixture 0 

Worth ngton. heated for 5 min at 80 °C before use) for 15 mm at « "^Sfe-jKf p^^^^ 5 mM EDTA (step m\ Hhaov (te . 28 

& an'd cWorofonn, and the DNA was f ^'^Jj! ^^^^J^^^e ItLdS 5^^ 

fr^eni was treated either with: a.Si exonudeasc «SfJ«5'^ *"= *^«^ deiribedfor pOMP0)or, c with S, exonnclease DNA 
nfTnNA Dolvmerase I (280 ng of HTicov were treated with 0.2 »"* "1='^^^'^^^ 'jase 0 20 ng of pOMPO DNA at steps I, H or III 
"p^lyiiSrafcr/lXgofth^DNipre^^^^^ 

u^s Heated wiA 7 nu of Hfcaov treated either a« m a, 6 or c m a hnat volume ™ . j ; transform Caaj-treated E. coli 1398 (ref. 35). 

U.hefoUowingcombinat,ons:pOMPLW»dn-^e|Wj^^^ 

DNA fragtttents of ^^y^l^tllT^'^^f.^ LT^ 



Sequetuing of pCRIov2-l DNA" later deitionstrated that these 
assumptions were correct (Fig. 3). 
mwayinwhichtheovalb,ittiinseqnence.nmaovwasfused 

in phase tc the beginning of the Z gene «^^P'«^f J" ^'^^^ 
hriet alter elimination of the single-stranded ends of the Hhaoji 

S nTlcase and/or DNA polymerase I, blunt-end ligation with 
ex^lTof T4 DNA ligaS^shotdd yield the strttctMe shown tn 
H« sin those strains where integration of the Hhmv fraginent 
00 utJ-^in the proper orientation^ lite hybrtd gene isthen 
expected to produce an ovaibumin-Uke protein of 389 mnino 
TctK which the first five a«ino oi ov— a^e 
substituted by the ftret eight amino aads of p-galactosidase 
fincludinE the first Met residue). 

ara^Snt molecules were introdnced by tra^f^««''«" 
into £ co« 1398 (ref. 35), a lysogenic host adequate tor the m 
Colony hybrkatio" method which v.c have reynt^y 
de^isS^Lpicillin-reststantcoloilieswere then screened^ 

'^^.Selled hLov. a probe which '^""'^ ^j^f^^^^j 
plasmids harbouring an ovalbum.n Jl^^^t^^^u^ 
eight independent strains showmg ^5°°?;'"""'? ^ 
colonies on X-gal plates, see legend to Rg. 3). The isoiateo 
is (pOMPL 2. 4. 5, 7.9^ 10. 11) were then transferred 
into a non-lysogenic host {C6(X)rKmi). 

Characterisation of the recomhinaat ptesmids 

The orientation of the integrated ovalbumin ^^Aue"*^ 

iTtet^ined as folios: r^^-^^'^'^'^^^llfXt Z 
two sites, one in the amp.allm gene f^PBR322 fef. 

88880£22T2;T OX 9Z6P 866 ^£19 



trfasmids do fail into these two expected classes, in ptasm.as 
St 2 and 4. the ovalbumin sequence is in the proper 
Senmt on to be transcribed from the lac promoter, whereas m 
nSS^OMPS, 7, 10 and 11. it U in the oppostte direction. 

T?eter«hfe wither some of Bk^c straina Prod«« an 
ovalbumin-like protein, we used a radioimmunoassay, m whch 
SdS — in mixed to b-ferial extrac^nd *en 

aUowed to react with nd S^o S Se 

details of the method are described in the legend to Fig. S The 
S were perfectly correlated with the structure of the 
^rfnna^t plasmids: plasmids pOMPl, pOMP2 and pOMP4 
S^sTve^ slong competitton, whereas the others findudrng 
^oS Sd not. PlasmW pOMP2 was arbritrarily selected for 
further studies. 

Productioa of an ovalbninln-Uke 
protein by pOMP2 

Further radioimmunoassays were f^^^''^ '° ^''^ZX 
amount of ovalbumhl-like product made m E. cok In Hg. ^, 
^r^tition bv pure, unlabelled ovalbumin (Ftg. 5i) is 
competitKm P^^jl^ anB. coli strain carrying 

3??f1 T^cSS by the latter is only 80% 
Sf^M^ts tha^one (or sev^al) anti^^^^^ 

rffit^^t^ of^l^ve ovalbumin. Because the si.e of the 



Bbarcl Assigned 



SIO Ud LOOZ 6Z NVf 



Nalme Vol. 275 12 October 1978 



SOS 

ovalbumin-like product is approximately «hat expected (Pig. 6 
and below), we favour the second mterpreiation. As we not^ 
!bovt °vi'bum!n. in th. oviduct, undergoes ^^-^ '"^^^l 

ctructure Of antifieniu determinants. Ak it is unlikely that sucn 
SSions tSe place it, E. coU. the ^'^^f>-«^l^^^"^^^ 
(which in any case differs from native ovalbumin b/^ few ™ 
acids in the N-tertninal etrf ) may be. for some of the anUgen c 
ste, a less efficient competitor " "^''''^^f tf 

phasic curve indicates that 2-5 xiu ceiis ^ anma 
equivalent of about 15 ng of ovalbumin. «hat' :«^"t 9^000 
molecules of ovalbumin-like protein per cell. If «hf«h°le curve 
was considered as monopha.,ic the estimate would b« of 4S 000 
nwkcules per ceU (5.000-4S.O00 in a vanety of independent 
expeiitnents, 30,000 on average^ 



Bacterial eslracl </iI) 
0.1 I IM 




F.S. 4 Restriction anatjW of ptemids canning the ovalbumin 

tone) were run at .5 Vcm ' as m ret. 40, Plasraid "NAs *ere 
p»ld U cleared lysates by l*"lf^°" r^/^ 
predpitalion, subjected to digestion fay ftil f^^f 
rilSrf •'^ and run in parallel w>th size flan^^S and 10. 

"d857S«»7 DNA digested by SroRI and BamH! lane 
Ad857S7 DNA diu^ted by EcdRl 43), l^^^*^^^^^ 
nNA digested by ffi/idllT (rcf. 34 . The sac of fragments was 
fs^maS " a s^^ifoKaritbrnic plot relating dtetanoe to (in 
kUobaK pairs). Plasm.ds pOMPl. 2 and 4 panes l- J/""/' 
resrctivcly) io* t*.o fragments of estimated size 6/ jo-^ " 

rMt>eaively) show two fragments of about 8.6-S and 0.7 kitobasra. 
Ptoitd P0MP9 (lane 7) does not yield the expected t"gmcnB. 
"pSU(ex«plforpOMW).anthusbees,,ma.^to 

range over 9.35-9-8 kilobases. The theoretical s=e « 9.63 k o 
ha,^ (using esftatcs of 4.35 kilobases for pBR322^. ^^^l 
tees for^«ov; 2.85 kUobases for the Ato ^^f "'f .^f; 
ment (as detcrtnined by electron microscopy). TT« respeclM 
orientation of the lac and ovalbumin sequences wh^h «» be 
deduced from .hesedatawasconfirraedinthe case otpOM^^ 
variety of othere,(perin»ntsincludingdoubted.g«tionbyM^^^ 
Hindlll. which yields three fragments of about 3.45. ^1 

Mobas^s. and digestion byM (<^'»/°»^, "^Vftefstf 
W plasmid pOMP2 digested by iliwllU (ret. 




OvaRminin (ng) 



Fm 5 Deiection of ovalbumin-like protein in bacterial eKtracis. 
oTaL-Sn li'Sa. 99% pure) (30 ..g in 20 pJ) «^bbcn«i 
with ImCi o£ Bolrnn and Hunter "fB*^" J/^/™^ 

wiferatoil^ ribbiu. by injection of 1 nig of ovalhumm 
by preparative gel electrophoresis) •=»"'«f,^^ 
comoletc adjuvant. A second in ection was made 3 weeks later 
Td b« were made 2 weeks after. Antibodk^ were punfed 

by <NH.)^, precipitation and '^^S'^l'^^^'lZ^T^l 
JllulosB and Biogel PVl colamns". Radioimmunoassays were 
S rirorin a r2l volume of 300 t^l in P^A (ph-Ph^Jl"'?; 
n 1 iTrMT s NariO ISM.boyineserumalbumms mgml to 
°dilfe"dk™ l^eria. extract (or of -'''-"'f,;^- 
^n)in PBSA.were added 20 ^1 of mature ^-^'-'^J^^^'f 
,„™sp«ific rabbit antiserum as a <^'''''' J^^J^J^^tf^^ 
Sed ovalbumin. This was followed by 
dilution of the spedik anHbodteS able P««f "t.^'^atlT- 
iodinated orotein. After an 18-h racubation at ''>^»""°??5'„ 
btnd ICin was separated from free °valbmm by add*on 
^ N, <;n 18% (w/v) final The precipitate was centrifuged for . 
fmtif ^ lempIraTure in a microfuge; supematants were 
pi^ttMaXuets and supernataniseounted in agamma »unter 
rSad e^riments with todinated antibodies demonstrated tha^ 
SSb"NaaSO.wBsessentiallycom^et.Wa„on^ 

fodinVled ovalbumin in the ^^"^°l'^ff^^^i,^^C) 
5-8% of input- Strain 1442 (BMH nsh fltproiac/ 
FrA^18wi4"nsformedbyp01vin.amlgrownonglucc^^ 

^i^toTm plates (to prevent loss of *^^^Xr™i^ 

overproduction of lac repressor). Uquid cultuTO oi i-^^ 

feOMW) and C600r.m:(pOMP2) i6 ^ bro'h «h 5«mb 

Snnicillin were inoculated from plates at about 1 0 cells per ml and 

IXoaCuxlO'baeteriaperml^ 

So (5 or 10 m^S). In some experimems cultures w«e ^ 

frozen and sonicated i« Ae prese„« of protease lnlubit«, PTSF^ 
p-toluyl-sulphonyl-fiuoride, Ega-Chemie, ^Owml )- " me 

Txperiment shown in bacteria ^"Xtlhe^d a^ 
onCA mniaininB PTSF frozen, somcatcd (2 X 20 s in tne coioj uiu 
d^uttilSX ' *e^e buirir mixed with a «ns.ant amount erf 
c tiVt^Ste^ent strain e^ 
out in duplicate. B/Bo is the ratio between the c.p.m. P««P>^ « 
the absence of ba^erial extract or unlabel 

ovalbumin (nonspedfically P'^<#'«tf J:?-""^ '^I'^t'fJri^out 
/! sitracts of 1442 (pOMP2) w«h (A) and without 
T^eS^ to ten S^kulaied as ^ of nonconcen^ 
S^STxIS (M- 1.2><10' bacterial b. Standard ctirve witii 
^:^u«^.a. -^-/o pure^ 50% competition corresponds to 
15 ng of pure ovalbuitun. 
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Ro 6 Size of the ovalbumin-like product in immunopredpitated 
Kriaf extracts. 5-ml cultures of l*.«<POMPO\Oane^ and^ 
and i442(pOMP2) (lanes 3-6) were inoculated at 37 C m 63B1 
Xcerol medium supplemented with S^.gmI ' amp.eaim and 
fventualirOanes 5 aSd 6) with 1 mM IFTG. grown to an A»o of 
0 5 labelled with "S-methionine (Amersham. fOM-Cml , 
800 Ci mMor') for 30 min poured over an equal vdumeo^ 
^,^hed frozen solution containing 10 mM NaNj. 200M.gml 
crushea -.niiomr^ PTSF The cells were then 

chloramphenicol and SOp-gml ^ 
SntrifuMd (S.OOOg, 10 min at 4°C) and frozen at -20 C. The 
^tt wfs resxSended on 0.5 ml of 10 mM Na phosphate buffer, 
^ 6 rand sScd (2X20 s in ice). Immunoprecipitates were 
Spa;d^d~edbUhoads«.^^^^ 

To lo' tichloroacetic acid-precipitable c.p.m. were added e.ther 

?ni of unTabelled carrier ovalbumin and 50 of bovne serm^ 

,u • m^A^ flanes 3 and 5), or 50 M-g of competmg unlabeUed 
albumm(BSAKlanes J anoD; m-b ^^^^ 

ovalbumm and 5^^^^^^^^^ ^^J^ deoxyeholate. 

foS^ NlV'o^pta^tL. pH 7.5, and 0^5 MNaCl to which 
on 1 of a nonspecific rabbit serum were added. The mixture 
?210 J> was tncuSed for 30 min at room temperature, layered 
^ onnTl nf 1 M sucrose, 1% Triton X-100. 1% deoxycholate, 
tS N^pholpt";r. pH 7.5, and 0.15 NaCl. and centn- 
uoS5min1uXinamicrofuge).Theupperph^ewascar^^^^^^ 
^ ..Ih V„H 70 uJ of rabbit anti-ovalbumin antibodies, enough to 
precipitate '-^1^8 ^ descnbcd 

r^r^ tte Som of the tube and the immunoprecipitate was 
from ™ " 1 f glycerol, 2% sodium dodecy 

resuspended f^^^^^l pH 6^^^ 0.7 M jS-mercaptoethanol 

itmtVen'ofbS^^^^^ 

^i„m?XXtehaepolyacrylamidegel.Theresolvinggetwas 

nut wfith the apparatus described by Studier at u, lor 
U h at s" fter the'^run, the gel was fixed and stamed wi h 
SomasJie brilliant blue R-250 (0.2% (Wv) m 
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' "'5- Vir 

^4,^ j-*-"! 
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(0.2% (w/v) m , J . ^ ft„nrnwaDhed with flash-activated Fuji RX filuK 

^ - irnnreenated with diphenyloxazole. dried and ""°™F"P^*° ,Xned size markers added to each 

methanol:water:acettc -^>f;. ^0 P^q/^. Arrows are 

1442(pOMP0); lanes 3 and 4, W42(pOMP2) withouj^ a^^^^^^^^^ ^^^^^^^.^ ^^^^^ 3 competitor. 



We cotild not clearly demonstrate, in C600 (pOMP2), that he 
svntheS of the ovalbumin-like product is under the con^ol of 
th^ SL^^^ (which is. at least Pf^^^'S^^TS^,''^^:, 
onerators) Because of this, we introduced pOMPl into a sttain 
K?) which overproduces lac repressor. Synthes^ was then 
s^mtilated by the lac inducer IPTG about 50-fold (F.g. 5a). 

Tht sle of the ovalbumin-tike product was ™t.«d as 
foS^extractsof cells harbouring POMP2 and labelledw«.«« 

S " s-methionine were precipitated by ant.-ovalbu^^^^ 
bodies and the dissociated precipitates were loaded on poly 
Samfde gels in denaturing conditions as explained m the 
kSnd to Fife The fluorograms in Fig. 6 show a predominant 
Snf conlfponding to a protein which ^^f^-^^^t 
or migrates slightly more rapidly th^^^^f.^'f^rigf .f^^^^ 
electrophoresis conditions. In strain ^"^^ ^^^^^'^^^^^^ 
tW< nrotein depends on the addition of IPTG. Addition ot 
2 rS^ed^ovalbumin in the — |SSn" 
eliminates most of the radioactive which mdi^ es ttat tt 
co^esDonds to an ovalbumin-like protein. In total label ea 
TxS no subjected to immunoprecipitation, the band could 
s^mettaes be directly observed. In summary, the ovalbumm- 
like product has a size close to that of ovalbumm. 

Synthesis of foreign polypeptides by E. coli 

We have constructed E. coli strains capable of syn*es«smg aii 

radioimmunoassay; (3) the size oi me prouuv-v j 
that of ovalbumin. 



Tneether these data strongly suggest that our E. strains 
Together, ^nesc u ,:te nroduct, the composition of which 

synthes.se an '^'"f/^f f that is, ^ght amino adds 

is probably very dose tothat^pecteM^^^^^^^^ 
of^-galactosidasean«81anHno acid^^ 
386 of native ovalbumm). i ne exai-i 
protein remains investigated ^ ^^^^.^^ 
Bacteria harbounng the re«)m^^^^^^ 45.000 or 90,000 

=ule:^^rri (de^Sng on t£ in— 
radioirnmu^^^^^^^ 

about 0.5-1 A ot ttie loidi P'" ,i,„„,ptical vield which can be 
figures are relatively ^^'^^^^^Tb^S^oSL monomers 
estimated (from ^^"""^Xfa muSpy plasmid with a 

! fr/nerfectlv stable and show no tendency to lose the 
strmns are f ^'^"f/'^^^^ ovalbumin is a secreted protem; 
recombinant plasmtd. S«=ond. wmo ovalbumin-like 

in preliminary ^'^P^"™^"^; fcOMPl) cultures. Third, 

product in the supernatant of (pUMr ^ ^^^^^^y 

L ovalbumin-Uke f ^^^tlufn^^r eTs^ meaLreme^ 
stable in thebactenal enwo™^^^ 

hasyetbeenmaJ;I^o«^,Ae^ons ^.^^^ 
not randomly distributed ana m ^^^^ 

codons(AUC,UUU,v-i-/^.'-'-^. . . known 
frequentl^in ovalbumin than^n^^^^^^^ 
sequence^*', such as MS2 or #Xl/4. ltwpo 

E coli in the correspondmg minor tRNAs aoes n 
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interfere much with expression of the ovalbumin gene 
sequence. 

This is, to our knowledge, the first example of synthesis by E. 
coU cells, of a long polypeptide chain from a higher eukaryote. 
The accuracy of the genetic construction which we made is not 
perfect, as five amino acids of the natural protein are missing and 
replaced by a few others from a bacterial protein. There is no 
theoretical reason, in our opinion, that the site of the junction 
between the lac and the ovalbumin sequences could not be 
moved down to the ATG initiator codon, leading to the 
synthesis of an accurate product. This is also the first example of 
a long polypeptide chain synthesised in good yield from a cloned 
complementary DNA made from mRNA. As a whole, our 
experiments demonstrate the feasibility of engineering bacteria 
for a potentially very large number of uses. Because of its well 
balanced amino acid content, ovalbumin is often used as a 
standard for nutritional value, and it is therefore conceivable 
that the ovalbumin-like product made by bacteria could be used 
for feeding purposes. . 

Biohazards associated with the experiments described m this 
publication have been examined previously by the French 
National Control Committee. The experiments were carried out 
accordingly in LIB 1 conditions''. 

We thank Drs P. Charnay and P. Tioilais for supplying the 
DNA of phage AcI857p/ac5AZL8UV5, Dr J. Rolland for help 
in the radioimmunoassays including the gift of rabbit antiserum, 
Mrs A Hemard for her help in the preparation of the manu- 
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Nucleotide sequence homology at 12 intron-exon 
junctions in the chick ovalbumin gene 
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A short partial sequence homology is present at all intron- 
exon junctions, or splice points, in the chick ovalbumin 
gene ; it is probably a signal for a splicing enzyme. The 
significance of the junction sequences for splicing is dis- 
cussed. We find no evidence of strong Watson-Crick base 
pairing between adjacent junctions. 



Our knowledge of the organisation of the ovalbumin gene has 
advanced rapidly because of the successful cloning of a major 
part of this gene'-^ Analysis of the clones' has shown that the 
gene is arranged in at least eight separate regions (I-VIII, Fig. 
1), which are colinear with the mRNA sequence*. But these 
expressed pieces or exons' are separated by seven sections of 



DNA or introns labelled A-G which are absent in the mature 
mRNA, Given this remarkably complex organisation, how is m 
^NA synthesised. Potentially an RNA polymerase co^ 
transcribe only the exons leaving the introns alone .-^^^^ 
'jumping polymerase' model"^ would essentially join up sections 
l-mi to give the correct mature mRNA. Alternatively the 
RNA polymerase could synthesise a^py of the entire gen 
copying both intron and exon regions. This pre-mRN A could 
cuf and the exons joined in the correct order, the -ntrons bemg 
eliminated. The latter hypothesis seems more likely as there > 
evidence of a 40S pre-mRNA' containing both exon and intro^ 

sequences. This mechanism also o<^J^^'^'^ ^^^I'^^'^Si 
such as those present in adenovirus^"", the -giobm ana 

The splicing must be a precise process. It has been suggeste 
that two adjacent exons distant in an immunoglobulin A cna 
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,ir -7;.-> <;„hseauentlv inverted repeat pair 5 of the newly 

, ON A S SSd fold back on hseU forming a new 
synthcaised DN A strana coum loi polymerase 
template for '""^er DNA sy".hp«^ ^^^^^^ 'Thrnewly 

•^""[f *S S i." it ^sLtverted repeS pair 3 and 
synthesised .^f^""^ " ff^j jj^ point the same slippage 
reaches P^'""" ^ ^f. J^^^^t S 
pcoccss ,s JL'^S a°U the newly made DNA 

repeat pair 3 and DNA byntnesis dKn.s j slippage 

Znd feading » J^. ^^^^ 'ScSlJ^fh^S 
tends to form long ^^f^^ synthesis can then pro- 

resist partial ™« f « ^jf^^d 

SSili H.. l\ - yj^j^^^f heteroduplex 
On another round of "Xl IS2 sequence 

molecule forms two ^^.f P"'^"'^' 

and the newly dnnm replication of 1S2 

TT,e slippage of th« D^^^^^^ 

and serand pa.« of E^^™ formed sequences generated by 
the A-T rtch region. The newy icnne ^ ^ 

^"-r rshX by iT^TmlS ^rts of the 
:eTu:r ■cottShe'U-cn si.nal is not yet .aown. 
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The cepUcation 'Slippage' model de^^m F^. 7 forIS2-6 

Hi:::" HSLn^ri:^^^^^^^^ assistance and 
We Aank Gudrun ""^"^ manuscrint This worlc was 
Patricia Nevers for reading the manuscnpi. imj 
suS^rted by the Deutsche Forschungsgememschatt. 

Rccdvcd 8 May; < 
F. «■ W. I., ^.-l^-^^^^^ BtaL 10. 305-309 (1976> 



Phenotypic expression m £. coJi of a 

DNA sequence codmg tot 
mouse dVoioIate reductase 

rCCSTt . SchuBklt & Slaide, N. Cohen tl 

s"isrs»-iSSo""«« — — 



meifll>ofe <ir«g trimethoprim. 



Ol/£0 -d 



Sn.CE the imtial P^t'on of eukanr^^^ 

several systems ^ave ^^.^^ f^Mgher orgasms. 

Escherichia col< at WA^nyto ^^^^^^ cukaryotes, 

logical activity °^ craj5«*. has b^n 

S^haromycescerewrae ^^^^'TT!^^ {^r functions that 
demonstrated '-'"gP^^lSi^ed homologous bacterial 
complement ^f^f^'^^^i, antibody made agam^ 

S^KaS^f^aP^^^^^^ 

-~~Z 14 u, addressed at Room S337, 

Stanford Umversity Medical Center, aiw 

88880£Z2I2I 01 9261^ 866 £19 



.^oence^.Veiyrc^tlMP-i-^^^^^^^^ 

proinsulin ,^„^.^!^ qNA (cDNA) transcript of 

Sonble^trandedcom^e^^D^^^ 
pre-prdnsnlm«RNA.mtha n^^^ 

for both msuUn '^^^^^^^^ned outside the celUtB not 
detected onafusedpeph^trans^ components 
toown, however, whrf«r the mam^ i^K 
of such immunologjcatly reactive «y 
functional "iolo^^^.f tnammalian gene expression in 
Our apFoach to the """^ °' 'Tlterogcneous population of 
baci^ria has been ^^"^^n.nce tS'^es fofa selec^bl^ 
clones carrying a DNA jeqne^ ^j^^jy thc»e 

mammalian gene P'°f"=^„TJ phenotypiciUy express the 
bacteria in the P<'Pit "amlua7^S:)™e dihydrofolate 
„e,ic -^Sv^fch^TheconCsionofdm^^ 
reductase (0HFR), especially suitable tor this 

folic acid to tettahydrofohc ^ hasTmuch lower affinity for 
purpose. The manunalian .^^^ ™s ^oes the cc^- 
L&=tabolicdmg,t«nftopnm2^ 

'Xf SSafffi aSr^^e resistant to levels of 
XL^Uia. ordinarily Host 
When th«e stBdieswe^m.^^^^^^^ ^ 

K3n°:n76 [:S^?rStrain^^^ already resistant to 
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high concentiaUons of Irimefhopr'H because of its thy' muta- 
tioa" direct selection of bacteria that synthesae the mam- 
roaliMi DHFR could not be carried out. Tlierelore, our mitial 
Studies used an in situ hybridisation procedure to identity 
tl776 clones which carried a ©puse DWR cDNA sequence, 
with the expectation that a highly sensitive indirect radiotm- 
rounoassay" would be used to detect any clones that caressed 
antigenic determinants of the protein producl encoded by the 
gene. 

Construction and doning of dhimaerk 
plasmids contaiiung a DHFR ds cDNA 

FiKure 1 summarises the experimental scheme used in this study. 
Partially purified mRNA containing DHFR sequence from 
roethotrexate-resistant AT-30D0 mouse cells'' was used m 
preparation of double-strand (ds) cDNA by RNA-dependent 
DNA polymerase (reverse transcriptase) and DNA polyinerase 
I (tef 14 and Fig. 2). Homopolymeric deoxy-C 'tails were 
added 10 the unfractionated cDNA by terminal deoxpuclco- 
tidy! transferase (Rg. 2) and the section of the gel containing the 
prcdomioanl (1,500 base pairs) tailed cDNA was duted. TTie 
material recovered from the gel was annealed with an equimolar 
concentration of pBR322 plasmid DNA that had been cleaved 
in the S-lactamase gene by the Pstl cndonudease and treated 
with terminal transferase to add honiopolyilicnc dG tails at the 
cleavage sites. The extent of annealing of DHFR ds cDNA w,^^ 
the plasmid vector was monitored by electron miCTOscopy 
using circle formation as an infator. PstJ sites are regenerated 
at both ends of the insert as a result of such re- 
circularisation"'" •Kn^nt 

Constructed chimacric piasmids were introduced into MM^.- 
treated" E. colt K12 strain ;ifl776 and tetracycline frc)-resis- 
tam transtotmants (yield -30 colonies per ng C>NA) were 
'elected and tested separately for the presenceof a DNAspe^e 
clplementary to a highly purified DIffR cDNA pt^be^M 

40% of the Tc-resistant clones gave a positive reaction by an m 
hybndisatioatcsf CTable 1 andF.g. 3).Ptem.dsfrom 14o 
the reacting clones were e=camined by gel electrophoresis and aU 
were found to contain a single DNA insert approximatdy UOO 

base pairs in length. As both ends of the -Pf J^^^^^il^^tes 
DNA receive homopolymeric dG (ails, only those molecules 
Sat hive acquired TdE-tailed cDNA insert or had esca^d 
cleavage and/or dG tailing would be expected to ^=-«^<^"'^^^ 
and trSnstorra; thus, wc presume that most of the nonreactmg 
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Rg. 1 Scheme used for doning and expression of """"f 
s^uences that code for DHFR. Aciditional Mperimental detwb 
are given in the legeBd to Table 1 and m the text. 

Tc-resistant colonies contain inserts of contaminating non- 
DHFRcDNA. ci^t 
When V2282. a ihy* variant of ^-1776, was approved for EK2 
use direct selection of bacteria tha, phenotyp.cally expre^ed 
the eukaryotic DNA sequence became feasible (see Fig. 1). I wo 
populations (25 colonies each) of previously f ""^fl Jc- 
Stant colonies of pooled, and plasmid DNA 

extracted from these populations was ■"tf'><»"f ^^^^ '^^"^ 
fotmaUon into ^-2282, In other expenmwte. 
;f2282d.rectlywithannealedpBR322-DHFRcDNA(Tablel^ 

Colonies that expressed resistance to both Tc and Ip were 
o^taTned in both types of experiments; three independent 
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TaM* 1 Transformation experiments usi ng xi776 and x2262 



TiansformanK per ng DNA 



Transforming DNA 
Plasmid vector Insert 

pBR322 Mon= 
pBR322 eDHA(r) 
pBR322 cDNAd") 
pBR322 cDNA{2 ) 

pDHFRT 



Tc 

(Sugnil ') 

60 
70 
60 
75 



Tc 

(10 (ig ml'') 

2x lO' 
32 



Tc/Tp 
(5 |ig mi" ot each) 

<2X 10"' 



In situ 
hybridisation with 
DHFR cDNA probe 
(% positive) 



40 
44 



1.3x10" 
25 



X1776 
X1776 
x2282 
X2282 

X2282 - 

■.. jr-, ii .inHBR rDNAfdesiEnaied 1°) was introduced into xl776 or x2282, u6iiiB a 
pBR322plasmidDNAlhat had been anneali^d inuj.ro «.U.dC-tailedD^^^^^ ,00 m! cf L brort. 

n.Siifica,io?of a previously *-*ed tr^s^^tion e ■ 1 ^^IZw^y^in. (4 « mP'^ Bacterial c«ltur« ""Xurel^ 

sutmleraemed with diaminopraieUc acid (DAP. 50 ml ) Jo,hedin 0 3 volume 10 mM NaO, resuspcnrtcd m 30 ml fresniy 

cXSTphase at 3S'C and then collected by cenortugatroii at 4^^^ 

Z^ZUCH b«t?er(70mMM^nz;40rnMsodiumaceta1e pHS_6 and30m^^ ^ ^ ^ g^.,.^ 50 „M o. MC> 

S^ded inl ml MCN and added in 200-fJ ^«!"°'' *°f « "jl^fj^ ^^l^SmtalhiW again L 30 mm. and SO-,J samples were P'-t^* °" » 
u1«Zi.f<,r rhilline ar 0° C for 30 mm, reactions were incubated at ^ ' <- whpn v->282 was used, the selective oKdmm 

SSrn\7agarVSenM with 0.5% casamino a^^^^^^ 

kanamydn (Km) as indicated. Plates containing ^ r existed of plasmid DNA isolated froin a nonfract«r«»c 

DNA iSdng the cDNA insert was used <B a conttoL '^^'^^I'^^^'j^^^^^^cWMt^^.ii. In the experiment shown in the last hae 

^olationoldonesthathadprcviourfj^^^ 

the tnmsforming DNA was isolated from a done UjXJMi-k n mn v 
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Pip Preparation and ch!,racteri£4itk)« ot DHFR cDNA luscr) 
1 hnI commentary to DHFR mRNA was sywhesised ««n- 

SmW <ev^' transaiptase and polysomsl KNA obmned by 

"oTnle— aL-.«i.tant AT-3(K)0 S^^^^^^ 
RNA had been esli.natcri to conlain DHFR mRNA •^J^Jtf 

DM A "5 ThB reaction was carrkd out m lOOitl using MUiig 
S.K'ln.ai S:VM[ma.edto?on,ain 5 «polyA-RNA), « uni^ 

"P riCTP (Amerstiam). Approximately 1.4 |ig cUtSA was 

Uput. f.^ the ^"^^if,^^^4":t^,^^:^f^ 

polymerase I «s<:>««ny a* °«<="'^, - „ u) units DNA 

d7«rfor 10 mm in 100 n "s-ng l-l ligcDNA, LUumtsLWA 
Ifvlieta" I a^d %0 of each deoxynudeotidc triphosphate 
S^ rdCTP adiit/d t^^^ C. mmol- as above. Approx.m.ldy 
0 S5 i?ot thil^nd strand was syn.hesised. The te^Con was 
I Irf extracted as above betore being passed over a 
f:^^lrG.57r li^n^n in TEN ^^^'^^ 
FDTA Column fractions containing the rts cPNA wmc tncn 

product, *. fi«t . „,^^, ofier S, nuclease t 
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9.9 
S.7 



2.3 
1.9 
1.77 



1.17 



D.S3 
045 



Imla' ° fi?sfst,a«l product after base .rcatn-ent; c, ^^o"'^^^^^^. on a U% agarose gel in alkaline om^X^ ^ 

described»^Tern<mal addition ^ -^f^^^a^'"^"^^^^^ f he reaction was carried out .n 500 ^J'"'."'^^^^^ 1 mM CoCl, 

carried out by a i„od.» 
base,U0mMKOW 



■itol. 150 tJ^dCTP (adjusted to 8 Ci mmol '^^r^Tn ' ppr«^^ 

(added to P'^*"'"'t'^TlTi-!Z^'.i „e7mnand0.5 ,J TdT(2.3 x lO' ""'^J"' >'^f 't*"^" '^^^ y iraiinus. The reaction was 



(added to piewarmcd reaction mix "^i" "" In V ;iTdT (2 3x10' units ml '). The reaction was dnv~-- ~ i- -7-- ^t., n^setion was 

stopped, extracted. ''"^^'''^-^"^ ^'f^^^^^^^ ^"^^f "^TTorofSfndelL^^^^ 

and 100 mM NaCl ^^^'^^'^'Zt^"]-^^^^^^^^ Biolabs) and .he taear P'^^""! 

agarose gel '"Tff ../?^^!'^,t„cadded^r3'terTni.i^ dG-tailed vector DNA were 

Approximately 'S--!" r^^" "^J^^ „^ coUecied. Equimoiar ""'"""'^ "".^^.'X.f o?7hatlte 

column in annealing buffer and the voKs vmun^ ^ p„t^lfoeT!amalcoininunicatip,n)excepttliatitievecw ^^^^ Thkannealed 



,ed. Equimoiar amounts of dC_.a. »l^ c^^^^^^^^ 

column in annealing I'lcrtbldby W." Rowe andR. A. Fiate! (P«™»^'=°»'""™""f6">^d'^ 'twA^ abo"< ' ™^ '"'"^'^'^ 

allowed to anneal ^'^<=»"""y=^ °i,arkation was monitored by electron mcoscopy and was typicaiiy 

the annealing reaction, ^^f^^^^ tratifonmUoo into ^1776 or Ar22B2. 



75 ng ml" 



transformantswercfoundtobe capable oftmme^^^^^ 

Tp resistance phsnotypes. However 'P , 
out for Tc resistance alone. I ranbier o| i^- ^ ,i.«™„h that all 

^^^e^l^LtteSnt encodeci by p.astnid. introduced 
into E. coll by transfoiTnation . 

structure ol the DHFR cDNA 

(rom three !>eparan,iy ucn-v showed simi ar overall 

hioh levels of trimethoprim resistance 
l:ma>^ Using such data, a cleavage map ff'tg. 4) of the cDN A 
n^Itt of one of these (pDHFR?) was constructed. 

OX 926^ 866 £19 



Examination of the aminoaddsequen^o^^^^^^^^^ 
25' Xf iSS^structural sequence to a positton 

analysts ot the ammo ^^'f.^^"t^^„^„';2ed direct verification 
T^rZS^l^n^^SlT^Zt corresponds to the' 
that the nucleotide DHFK enzvine, 

amino add s^uence reponed fw^^^^ 

ami ^'f !™^KA^^^^^^^^ end of the mRNA. 

ocated at the ds cUISA equivaic .„„,22-cDNA iunct oit 

nearest the ^^""^ °/ * p^f^^^fifof special interest. The 
cDNA (that ^]-^'!'^^!';,,'^^^ p-laiSttsase gene of the 

vector (J. G. Sutclitte, P®'~"^, rl . . ^^^^ (,„ ,he terminal 

,he i>«l site --'^ ^fiiSS^cdt^ly by (1) an ATG 
transferase, and these are to'»J^" , j j^„t amino 
(AUG)pro.e.nst^cgton,and^^^^^^^ 

acid of the mouse '^^f „rientati a^s at tliat 

:fcS£-^^S:^~*erecrp.astnid;howev^ 
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number of incremental G residues (that is, 10) at the vector- 
insert junction ensures that the DHFR cDNA sequence is not m 
the same transiationa! reading frame as the p-!aclamase £cne. 

If the phennlypic expressiun we have observed for the niouse 
DHFR sequences in bacteria is tlie result of transiationa] 
readthroiigh from signals that initiate protein chains within the 
P-lactamase gene, then the host bacterial cell must be able to 
circumvent the observed frame shift by a 'slippage' niechanism 
of translation. Such an event might potentially be aided by the 
long run of dG residues introduced at the pBR322-cDNA 
junction and could account for our observation (Table 1) that 
plasmids conlainine a DHFR cDNA insert yield more than t»'ice 
as many expressors as would be expected from considerations of 
reading frame and orientation. However, the high level of 
functional expression observed for both primary and second- 
arily transformed clones of pDHFR7 does not seem to be 
readily explained by slippage of tRNA molecules during 
translation- ^ „ r , n 

Perhaps a more likely explanation is that the ftU-pojyU- 
ATG sequence that has been constructed preceding the coding 
sequence for DHFR serves as a binding and protein initiation 
site for the bacterial ribosome. Recent studies have 
identified sequences on mRNA ij5 the 5' direction froro the 
initiator codon that arc complementary to (he CCUCC 
sequence at the 3' end of the 16S ribosomal RNA species, 
proposed by Shine and Daigamo^' to be involved in the binding 
of mRNA to ribosomes. 11 is tempting to speculate thai the 
mRNA transcript from the sequence at the pBR322-cDN.'V 




Rg. 3 Detection of colonies coniaining DHFR cDNA inserts by 
insilu hybridisation. Colonies were screened fof DHFR sequences 
using a modiBcBtion (G. N. DaeB, unpublished) of an in siiu 
hybridisation procedure". Tc-t«sistant colonics were transferred 
to nitroceUulose filtera (MflUporc, HAWG) that had been placed 
on Pcnassay broth agar plates containing Tc (10 iig per ml). 
(Filters had been washed twice by txjiling in H2O and autodaved 
Iwtlorc being placed on plates.) After 2-3 d of bacterial growdi al 
32'C, the filter was removed trran the plate and placed on a 
Whatman no. 3 pad saturated with 0.5 M NaOH. After 7 roin, Uis 
filter was sequentially uimsIerTed to a series of similar pads 
saturated with I M Ttis, pH 7.4 (twice, 7 min each); 1 .5 M NaCI. 
OSM Tris, pH 7.5 (once. 7 min); and 0.30 M NaQ, 0.03 M Na 
citrate (2xSSC) (ouce, 7 roin) After the excess liquid had been 
removed by suction, the filter was placed on a pad containing 90 » 
cdiancl. dried by suction and baked in vucuo at 80°C for Zh. 
Before hybridisation, filters were pretreated for 3-6 h at 6-5*C ra 
hybridisation buffer that contained 5xSSC, pii 6.1 0.2% SDS, 
0.027O l icoll 400 (Pharmacia) and 8(igmr E. coli tRNA. 
Hvbridisaticws were carried out with individual filters in 1.5 10] 
hybridisation buffer conlainiog 2x10' c.p.m. "P-labeDed purified 
DHFR cDNA' ' in a sealed plastic bag at 65 "t: tor 24 h. The filters 
were then washed in hybridisation buffer (once, 60 min al 65 "Cy, m 
5 X S.SC, pH 6.1 (three times, 60 min each at 65 °C); and in 2 x SSC. 
pH 7,4 (twice, 10 roin at room temperature), air dried, and pre- 
pared for autoiadiogtaphy. Ufc top, a coUeoioa of Arl77fi 
colonies which coniain a DHFR cDN A insert; middle, Tc-resistant 
y-22S2 colonies derived from transtrarmation with annealed 
pBR322 cb cDNA— both reacting and non-ieacting colonies arc 
seen; bottom, colonies containing pBR322 and pACYClOl plas- 
mids which show no visible hybridisation. Right: several 
positive colonies on a representative filter analysed in screening 
v.n76 liamformants. Negative colonies ncprcscnt clones 
containing pBR322 or pACYClOl plasmids which show no visible 
hybridisation. 
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junction has sufficient complementarity to the CCUCC 
sequence to allow ribosomal binding when a transiationa! start 
signal is located an appropriate distance away. In such an event, 
the ATG protein start signal that immediately precedes the 
coding sequence for the mouse DHFR might initiate a peptide 
chain having a si^e characteristic of the mammalian enzyme. 
Immunological analysis of exlracts derived from pDHFR? and 
other expressing clones has yielded results consistent with this 
interpretation (see below). 

Analysis of enzyme activity encoded by 
pDHFR7 plasmid 

Mammalian dihydrololate reductases can be distinguished from 
their bacterial counteiparts by the ability of mammalian 
enzymes to use folate as a substrate and by their differential 
sensitivity to competitive inhibitors"'"', fn initial experiments, 
the reduction of folate to letrahydrotolate was measured using 
extracts from the pDHFR7 clone, from a Tp-sensitive clone 
containing a DHFR cDNA insert (the pDHFR 1 0 plastnid), and 
from ceUs that contain only the pBR322 vector. Although all 
three clones are capable of synthesLsing a chromosomally 
produced bacterial enzyme, only the enzyme present in extracts 
from ceils containing the pDHFR7 plasmid gave reducrion of 
folate (4 X background). Additional evidence tliat the reductase 
encoded by the pDHFR7 plasmid is of mammalian origin was 
obtained by inhibitor analysis (Fig. 5). The DHFR isolated 
directly from mouse cells and the activity encoded by the 
pDHFR7 plasmid showed identical sensitivities to methotrex- 
ate, triiiKthoptim and a triazine derivative (2,4,-diamino-l-(4'- 
butylphenDl)-fi,6 dimethyl- 1, 6-dihydro-l,3,5-triaziiie); both 
enayrae activities were 200 times more sensitive to the triazme 
than to trimethoprim. In contrast, bacterial dihydrofolatc 
reductase is inhibited more effectively by triinethnpnm (K - 
5 X 10"') than it is by triazine (K; = 6.5 x lO'")'-. 

As methotrexate binds stoichiomenrieally to dihydrofolatc 
reductase", we can estimate the number of molecules of enzyme 
in the pDHFR7 plasmid extracts from the methotrexate inhibi- 
tion date of Fig. 5. We calculate from the specific activity of the 
extract (3 units per mg of soluble protein) and the specific 
aclivity and methotrexate binding parameters of the mouse 
enzyme^ that 0.01% ol the soluble bacterial protein is active 
mammalian DHFR. 

Ijnnnmological characterisation of bacterial 
cell exlracts containing mouse DHFR 

Immunological evidence confirming the nature of the DHFR 
encoded by pDHFR? and other plasmids that contain a mouse 
DHFR cDNA insert was obtained using a solid-phase sandwich 
radioiraraunoassay'l Tp"* clones of x2282 containing the 
independentlv derived plasmids pDHFR 7, 12 and 13-showed a 
strong reaction with rabbit antibody directed against mouse 
DHFR in an in sim immunoassay'* (data not shown); protein 
that reacted with the antibody was also made by baclena which 
showed low levels of phenotypic expression and by some clones 
that did not mal<e a biologically functional DHFR (that is, were 
Tp sensitive). The nature of the antigen synthcsised by Tp ana 
Tp" cioimes was examined nwre fully using a newly developed 
method (filter affinity transfer, or FAT procedure) lor the m aVi 
immunological characterisation of proteins in gels' . this pro- 
cedure depemfc on the covalcnt coupling of F(ab)i antibody 
fragments to a chemically derivatised and activated cellulose 
filter; antigen transferred on to the filter from an SDS -poly^ 
acrylamitfe gel is detected by subsequent incubations wilh 
antiserum and '"I-labelled Smphylococcus aurew: protem A 
(ref. 12). ^. , 

Filter affinity transfer analysis of the pDHFR7 extract (Fig- 0, 
lanes £>, c) shows the presence of protein that reacts immuno- 
logically with the antibody to mouse DHFR and ftirther sho«« 
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1 dG-C IcDNft INSERT " r, »=„«.-r.vlleGlyLysA5"GlyMP"u?"- 

TAILED ' i „ .„ i.,Pi,<,TleVaiAlaValSerKlr.AEnHeifi!yil.'=i»J.yi-y»"= 

-ri^/crT™^'-'^^ 252 
■ ■ ■ — -fr Hinfl ... 

''''' "^^^^ 

■ ACCcncArrrCA^cGTACCA^ 
253 Alul 

121 

fTTCluGluMaJIecAsn ... . 

lAC-^iGGMCCCATOAAT . ■ ■ 
ATG -IXCrtCGSTACnA ■ ■ • 

361 

d ' fia! CH*JAse<)UCncc of cDNA insert 

^^^^'^-"^I^i^T.lltn':.' nthcS th. »RHA horn .positioel''!" 



X acids ar. numbered from 'If ^1 wcS ^^^'^c method of M^^-^/f^' ^ to Kpho;^ 6% acrytem.'* S^' 
and TBE butter''. Fragments a, i> and . «rre ^'^.^^^^^ ^^^^^^^ described in J=t> «) and 
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bacterial cells that ^-" ^"^^^^^Zfiis^P^^ of coding for a 
1,500-base pair "^^i^t P'^.'^'^w we conclude that the 
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A ,^-mn„^e DHFR was detected in extracts of cells 
:atmg^ta:e7BR?2?ror(F..6,.anc/). 

Variation in level of expression of mouse 
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Fis. 5 Inhibitor analvsis of DHFR from bacterial cells. 8131.01)317 
phase cultures of x2282 expressing trimethoprim reaslaiKe were 
grown in the presence of Tp (1 mr') in mininial medium, 
washed with isotonic saline, and suspended in 50 niM potas.«um 
phosphate buffer. pH 7.0, containing 10 inM ben^amidiiie and 10 
!pM phenyl methyl sulphonyl fluoride (3 volunnes boBer to 1 
volume cells). The suspension was sonicated and centrifugcd at 
10 OOn r p m for 15 min- The supernatant was centrifugcd lor 1 h 
at 'l0O.O00g before being studied. An melliotrexate-resisiant 
mouse cell extract was prepared as described elsewhere . EMyine 
activity was measured by the radioactive folic acid assay previoud^ 
described^^. Protein was determined by the method of Lowry . 
Approximately 3 units of activity from the x2282 ettraet or 5 un^ 
fr^ the metbotieme-resistant mouse cell cxiraa were mcubated 
with inhibitor for 10 min at 24°C before assaying for folate 
reductase aclivity; the cuncentralions shown represent the hnal 
concentration of inhibitor in the reaction nuxture. Background 
values, determined by measuring enzyme aoivily in the presenoeof 
10 mM melhotnjxate, have been subtracted from all P"""^.'?;^ 
results presented are the average of duplicate samples whrch 
generally varied by less than 10% and are enpressed asa pereen- 
Wge of the value obtained in the absence of inhibitor. Oi« unjt of 
aiSvityistlieamoiiat of enzymeneededio reduce I nmolof fotolein 
] 5 min at iTC. A and A indicate addiuon of methotrexate, Oand 
• indicate addition of the triazine derivative, and D and ■ indicate 
trimethoprim addition for x2282 and the mouse cell extracts, 
respectively. 



react with antibody to mouse DHFR (Fig. 6 and uapiihlished 
data). It Ifi particularly inttaresting that the clone carrying 
nDHFR25 expresses a low level of Tp resistance, although the 
coding sequence for the DHFR enzyme is inserted in an orien- 
tation opposite to that of the p-lactamase geae. This finding, and 
oar detection in extracts of the pDHFR25 clone oC protein that 
reacts immunologically with aolibody to the mouse enzyme 
(unpublished data), suggest that readthrough transcription mto 
the DHFR coding sequence from a promoter sequence located 
dn or near the distal segment of the ^-lactamase gene may occur. 
Consistent with this interpretatton are preliminary data suggest- 
ing that DHFR antigenic sites are abo synthesised by cells 
carrying pDHFR23, which is a non-expressor of Tp resistance 
and contains a cDNA insert in orientation b (Fig. 7). Further 
study is required to determine whether sequences in the distal 
segment of the ^-lactamase gene are capable of serving as weak 
promoters for the initiation of mRNA chains Aaf extend into 
the DHFR cDNA. 

The findings reported here indicate that the bacterial clones 
we have cotistructcd arc synthesising and phenotypically 
expressing DHFR encoded by mouse cDNA sequences; (1) the 
cDNA insert cloned in bacteria has been shown by m suu 
hybridisation to be homologous with the mouse gene and by 
direct DNA sequence analysis to encode the amino aad 
sequence of mouse DHFR, (2) DHFR enzymatic activity and 
resistance to Tp are specified by nucleotide sequences present 
on chimaeric plasmids but not on the vector, (3) the DHFR 



Re. 6 Rller affinity transfer analysis'" erf bacterial ceU extracts. 
20 H.1 of extracts in SDS sample buffer were run at constant current 
tor 3 h in an 1 1 .25% SDS-polyactylamide slab gel. TTic ge was 
incubated m PBS (50 mM phosphate buffer ^^=7^ 
NaQ) for 30 rain and placed on a blotter wet with PBS. PepUdes 
were specifically transferred from the gel to strips ^ dryceUulose 
filter that liad been ccrvalenlly coupled to anti-DHFK h(aD)2 
fiasments'^ Fillets were washed, incubated J-'iMJ '<> 
DI&R, washed again and treated with "'l-labeUed protein K 
After additional washmg and dryingsteps, the filters were analysed 
by autoradiography. THe eloate fracHon Pane w^ob'«"«' ^y 
passage of an extract of x2282 ceils containing pDHFR7 o«r a 0.5 
m! meihotiexat^pharose affinity column. THe extract was 
acidinsd to pH 5.8, passed over the column and the bannd fraction 
was ehiied iith 2 mM folic arid in a 5 mM NallCp, buffer a pH 
8.5 as described elsewhere^'. Lane a, extracts mouse eel line 
lanes b and c. pDHFR 7; lane ieluale from methotrexate column, 
lane e, pDHFR21v lane /, pBR322. 



insert to the p-lactama.se gene sequence of the vector the length 
of each insert, and the minimal inhibitory concentration (MIC) 
determined for the done are shown in Fig. 7. As can be seen, 
piasmids pDHFR7, 12, 13 and 25-29 all contain a cornpkte 
DHFR structural sequence inserted in the same orientation 
(that is, orientation a) as the gene encodmg the bactenai jJ- 
lactamase. lite clone carrying each of these plasmids expresaes 
Tp resistance, although the MIC varies from 150 ,tg ml tor 
PDHFR28 to >1.0(X1 p.g ml ' for pDHFR7. 12 and 13; the 
greatest reactivity with antibody to mouse DHFR occurs mxb 
pDHFRI2 (unpublished data). . j . ■„ 

It is unlikely that translational reading frame is the determin- 
ing factor m the different levels of expression observed m these 
clones, as our DNA sequence analysis indicates that a correct 
reading frame is not essential for efficient expression in pDHl-K 
7 12 or 13 However, the positioning of the putative ribosomal 
binding site in relation to the ATG start codon may potentially 
influence the strength of expression by affecting the fonnahoa o 
the translational initiation complex (compare wifk re(. ii)^l<igs 

The end of DHFR structural sequence that 
5' end of the mRNA is not present in piasmids pDHFR 21 ana 
24 thus explaining the observed lack of functional expression ot 
the cDNA in these clone.-!. However, pDHFR21, whidi has lost 
less than 1";% of tlie structural sequence, nevertheless encodes a 
(probably hybrid) peptide that contains antigemc sites which 
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N Galaxies— a new dass of X-ray sources 

BURBIDGF.', classified cxtragalactic objects with bright nuclei 
into thret classes: in order of iiiaeasing optical luminosity these 
are (1) Seyfert galajiics; (2) N galaxies; (3) quasars. Many 
Sevterts and several quasars have been shown to be X-ray 
sources. Here we show that N galaxies are also powerful X-ray 
sources. In fact, all six N galaxies in the 3C radio catalogue witb 
rcdshifts les.s than 0.06 (ref. 1) are detected by the first fuU-sky 
.survey of the Goddard Space Flight Center detectors (A2) on 
HEAO 1 X-ray emission has also been discovered from a strong 
Southern Hemisphere radio souro;, the N galaxy Pic A. Sue of 
the seven objects are classified as broad-line radio gaiaxiM 
(BtRGs)" and one (3C371) is classified as a BL l.ac object . 
These objcct-s are differentiated from the newly discovered dass 
of 'emission Une' galaxies' by their high radio flux and extremely 
broad lines. Before HE AO 1 there was one confirmed N galaxy, 
3C120 (ref. 8), and one suggested. 3C390.3 (refs 9 10). We 
detect 3C120, streng*en the identification of 3Cjyu.j, ana 
orcscnt evidence tor X-ray emission from four new sourc^, 
301 1 1, 3C382, 3C371, and Pic A. We also suggest 3C445 as the 
identification of 2 A2220-022 (lef. 11). 

Fimrc 1 shows 90% confidence enor boxes on the positions 
of these seven galaxies together with 4U (ref. 10) or 2A (ref 11) 
error boxes where applicable. The 2-10 keV luminosity of these 
obiects, as well as other relevant information, is given in Table I. 
We have also looked for X-ray fiux from the next closest N 
galaxies FKS0521 - 36 and 3C227. No significant Bux was seen, 
but the 90% upper limit on the X-ray luminosities (given in 
Table 2) are consistent with their luminosities being comparable 
to the seven detected N galaxies. Except for quasars, these N 



galaxies ate the most luminous class of compact X-ray sources 
vet dctcctBtJ 

Grandi and Osterbrock have classified radio galaxies spec- 
trosoMjiqaBy as BLRG or narrow-line radio galaxies (NLRG) 
and have demonstrated a high degree of association between the 
classification 'broad-line' and the morptological ^ . In 
marked contrast to the situation Cor BLRGs. only 3(,40> t^yg 
A) of the five NLRGs lUted by Grandi and Osterbrock, with 
redshifts less than 0.06, is a detected X-ray source. 3C3n is 
confused with Ab2052, a distance class 3 cluster, and sojM 
luminosity is poorly determined. Upper limits on the flux troffl 
the other threTNLRGs are given in Table 2. The dedxiced upper 

Umits on the luminosities are eonsider^ly leffi than average 
X-rav luminosities for BLRG of ~3 x 10" erg s V Note that of 
the delected N galaxies, the one with the narrowest emission 
lines, 3C371. has the smallest X-ray luminosity. 

lack of X-ray emission from NLRGs contrasted to the 
virtual certainty of X-ray emission from BLRGs is similar to th« 
distinction between type 1 and type 2 Seyferts. that 's.nonew 
the -20 known X-ray emitting Seyfert galaxies are classical type 
2 Seyferts. Grandi and Osterbrock have drawn atlenUon to the 
optical spectral similarities and differences between BLRGs ana 
twe 1 Seyferts, and NLRGs and type 2 Seyferts. respectively. 

The fact that many of these N galaxies possess compact radio 
components suggests that the synchrotron self-Compton pro- 
cess (SSC)" may be the prindpie cause of the X-ray emi^iW. 
For 3C390.3. the best studied of these objects, a rn™^ 
predicts a physical size to the X-ray e^«''^S/tl'¥l° 
md a Ume scale for X-ray variability of "1°"*^ "7^' f '5*° " 
of 3C390.3 by Uhuru and Copernicus at consider^ly hi^f^ 
flux levels and an Ariel 5 upper limit on 3C382 (ref . 14) « 
lower than the present detection indicate possible variability m 
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Chicken ovalbumin is sj^thesized and secreted by Escherichia coli 

(recombinant DNA/gene expression//ac control/plasmids) 



Thomas H. Fraser and Barbara J. Bruce 

Research Laboratories, The Upjohn Company, Kalamazoo, Michigan 49001 
Communicated by Alexander Rich, September 22, 1978 

ABSTRACT By recombinant DNA methods, the chicken 
ovalbumin structural gene has been fused to Escherichia coli 
lac transcriptional and translational control regions. When a 
plasmid containing the hybrid gene was introdu<^ into K cob, 
a protein identified as ovalbumin by immunoreactivity and 
sodium dodecyl sulfate/pdlyacrylamide gel electnq^resis was 
synthesized. The chicken ovalbumin made in bacteria was full 
length (43,000 daltons) and constituted 1.5% of the cellular 
protein. In addition, the microbially synthesized ovalbumin was 
secreted through the cell membrane into the periplasmic space 
of K coli. The ability of the K coli secretory apparatus to rec- 
ognize chicken ovalbumin, which is normally synthesized and 
secreted in hen oviducts, suggests that common features exist 
in the secretion-recognition mechanisms found in these two 
organisms. The bacterial synthesis of significant amounts of 
chicken ovalbumin demonstrates that the £. coli cellular ma- 
chinery may be utilized to synthesize a higher eukaryotic pro- 
tein which is relatively stable in the bacterial intracellular en- 
vironment. 



Many of the potential benefits envisioned as a result of the ap- 
plication of recombinant DNA technology to the solution of 
medical problems require the insertion, into microorganisms, 
of genes coding for proteins normally found in higher eukar- 
yotic organisms. Although it has been possible to clone a number 
of different higher eukaryotic genes containing the information 
necessary to code for proteins, reports of the expression of these 
proteins in bacteria have been limited to the htmian polypeptide 
hormone somatostatin (1) and rat proinsuliii (2). 

The somatostatin gene was chemically synthesized and the 
DNA sequence coding for its 14 amino acids was fused to the 
/3-galactosidase structural gene on a plasmid. Yields of soma- 
tostatin varied from 0.001 to 0.03% of the total ceUular protein. 
The rat proinsulin gene was inserted into a plasmid-bome 
penicillinase gene and the bacteria produced approximately 
100 molecules of proinsulin per cell. 

We report here the in vitro construction of a multicopy 
plasmid in which the lac operon control region was fused to a 
DNA sequence coding for chicken ovalbumin, a 43,000-dalton 
protein. This egg white protein is normally synthesized and 
secreted in the chicken oviduct. Bacteria containing the fused 
plasmid synthesized about 1.5% of their protein as full-length 
immunoreactive ovalbimiin. The production of ovalbumin has 
been shown to be under lac control and the protein is stable in 
vitro to Escherichia coli proteases. Because E. coli is capable 
of secreting specific proteins, we also performed experiments 
to determine whether the E. coli secretory machinery can 
recognize the ovalbumin molecule and found that ovalbumin 
is secreted through the cell membrane into the periplasmic 
space. 



The publication costs of this article were defrayed in part by page 
charge payment. This article must therefore be hereby marked "ad- 
vertisement" in accordance with 18 U. S. C. §1734 solely to indicate 
this fact. 



MATERIALS AND METHODS 

Cells and DNA. E. coli HBlOl (UC 6479) (3) was used in all 
experiments. The lac UV5 promoter plasmid pOP203 was 
constructed by F. Fuller, L. Johnsrud, and W. Gilbert (personal 
communication) by cloning a 203-base-pair Hae III fragment 
of lac DNA containing the promoter, ribosome binding site, and 
codons for the first seven amino acids of /3-galactosidase into 
the tetracycline resistance plasmid pMB9 (4). The plasmid has 
a single EcoRI site located so that the lac promoter will direct 
transcription into any DNA fragment l^ted into that site. The 
pOV230 plasmid was constructed by L. A. McReynolds, J. F. 
Catterall, and B. W. O'Malley (5) and was obtained in a coop- 
erative laboratory arrangement. Plasmid DNA was prepared 
by the salt precipitation method of Guerry et al. (6) and further 
purified by cesium chloride/ ethidium bromide density gradient 
centrif ugation (7). Transformation'was done essentially as de- 
scribed by Wensink et al. (8). Cell extracts were prepared by 
grinding with alumina, and protein concentrations were de- 
termined by the method of Lowry et al. (9) with bovine serum 
albumin as the standard. 

Enzyme Reactions. All restriction endonucleases were 
purchased from New England BioLabs. EcoRI, Taq I, and Hha 
I digestions were done in reaction mixtures recommended by 
the supplier, except that autoclaved gelatin was substituted for 
bovine serum albumin. Alkaline phosphatase treatment of 
EcoRI-cut pOP203 DNA was carried out essentially as de- 
scribed by Ullrich et al. (10), except that the enzyme was 
preincubated at 70"'C for 10 min prior to addition of DNA. T4 
DNA polymerase-catalyzed fill-in of staggered ends was done 
in a reaction mixture described by Goulian et al. (11), con- 
taining 0.2 mM deoxynucleotide triphosphates. 

EcoRI linkers, obtained from Collaborative Research, were 
phosphorylated by using polynucleotide kinase (12) and then 
heated to 70°C and slowly cooled to 4°C. The reaction mixture 
was then brought to 45 mM Tris-HCl (pH 7.8), 10 mM 2-mer- 
captoethanol, 10 mM MgCk 15 mM dithiothreitol, 1 mM ATP, 
20 ng of Taq I-cut pOV230 DNA per ml, and 14 units of T4 
DNA ligase (New England BioLabs) per ml and incubated at 
12.5°C overnight. Joining of the purified ovalbiunin gene 
fragment to the alkaline phosphatase-treated pOP203 was also 
catalyzed by T4 DNA ligase (12). 

Gel Electrophoresis. DNA was extracted from agarose gels 
by maceration and overnight incubation at 47°C in 10 mM 
Tris-HCl, pH 8.0/2 mM EDTA/1 M NaCl. The agarose was 
pelleted at 100,000 X g for 1 hr and the supernatant was phenol 
extracted, ether extracted, and ethanol precipitated. 

Sodium dodecyl sulfate (NaDodS04)/polyacrylamide gel 
electrophoresis for protein separation was done as described by 
Laemmli (13) with a Bio-Rad slab gel apparatus. Samples were 
dissolved in an equal volume of sample buffer [61.5 mM Tris- 
HCl, pH 7.8/3% (wt/vol) NaDodS04/5% (vol/vol) 2-mer- 



Abbreviation: NaDodS04, sodium dodecyl sulfate. 
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captoethanol/20% (vol/vol) glycerol] and heated at 100°C for 
5 min (14). 

Immunoassays. The in situ immunoassay was done as de- 
scribed by Skalka and Shapiro (15), with the N-Z bottom agar 
containing 25 /xl of ovalbumin antisera (Antibodies, Inc., Davis, 
CA) per ml, 10 /xg of tetracycline per ml, and 1 mM isopro- 
pylthiogalactoside. 

Immunoreactive material in cell extracts was precipitated 
for gel analysis by adding antiserum to extracts. The mixtures 
were allowed to stand for 60 min at room temperature and then 
overnight at 4°C; the precipitates were washed twice with 25 
mM Tris-HCl, pH 7.4/137 mM NaCl/5 mM KCl. 

Containment. All recombinant DNA ej^riments were done 
under P2-EK1 containment. 

RESULTS 

Construction of the /oc-ovalbumin fused plasmid, pUClOOl, 
proceeded as shown in Fig. 1. In our construction we cut the 
pOP203 plasmid with EcoRI and treated the resulting linear 
molecule with alkaline phosphatase, removing the 5'-phosphate 
groups to prevent recircularization of the pOP203 vector DNA 
during hgation (10). 

The pOV230 plasmid was constructed with a pMB9 vector 
and contained nearly all of the ovalbumin mRNA sequence, 
including all of the information required to code for the amino 
acid sequence of chicken ovalbumin. The sequence of the 
ovalbumin gene insert in pOV230 (16) revealed a unique Taq 



ovalbumin 




jjaq I digestion 



pCGA T 

T AGCp 

1. T4 DNA polymerase 

2. T4 DNA ligase 

-•■ Eco Rl linkers 



pGGAATTCCCGA TCGGGAATTCC 

CCTTAAGGGCT AGCCCTTAAGGp 

1. Hha I digestion 

2. Eco Rl digestion 

3. Gel purification 



pAATTCCCGA TCGGG 

GGGCT AGCCCTTAAp 

T4 DNA ligase 

* Eco Rl cut pOP203 




Fig. 1. Outline of steps used to fuse /ac control elements to the 
ovalbumin gene, resulting in plasmid pUClOOl. 
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I restriction endonuclease site 25 base pairs to the 5' side of the 
ovalbumin initiator AUG. Fusion of bacterial control regions 
at this site would allow expression of the entire ovalbumin 
structural gene sequence. As shown in Fig. 1, an additional Taq 
I site is located approximately 250 base pairs outside of the 
ovalbumin insert such that Taq I digestion yields a DNA frag- 
ment of about 2200 base pairs containing the entire ovalbumin 
structural gene. 

Because the sequences of the lac control regioninserted into 
pOP203 (17, 18) and of the ovalbumin gene insert in pOV230 
are known, our strategy was to fuse the lac and ovalbumin 
DNAs in a manner that would maintain the translational 
reading frame, assuming protein synthesis began at the /S-ga- 
lactosidase initiation codon. There are several ways in which 
this can be done, but we chose to fill in the stagger-ended Taq 
I fragments with DNA polymerase and then ligate synthetic 
EcoRI octamer linkers to the blunt ends as shown in Fig. 1. A 
large excess of linker molecules was used in the reaction to 
prevent ligation of the filled-in Taq I fragments to each other. 
The hnked fragments were digested with Eco Rl to generate 
5' staggered ends. Gel purification was facihtated by digestion 
with Hha I to cut a pOV230 fragment that otherwise migrated 
with the ovalbumin gene. 

After elution from the gel, the purified pOV230 DNA 
fragment was ligated to the alkaline phosphatase-treated 
pOP203 plasmid vector. This ligated DNA was then used to 
transform E. coli HBlOl, and tetracycline-resistant colonies 
were selected. The transformants were assayed for ovalbumin 
production by an in situ immunoassay. Colonies were lysed 
with lysozyme and Sarkosyl after growth on agar containing 
ovalbumin antiserum, isopropylthiogalactoside, and tetracy- 
cline. Easily discernible precipitin rings formed around several 
of the colonies, indicating that they were producing substantial 
amounts of ovalbumin. 

In order to verify the key features of the plasmid DNA 
structure in the ovalbumin-producing bacteria, plasmid DNA 
from eight transformants was subjected to restriction digestion 
with Hae III alone and with both Hae III and Eco Rl and ana- 
lyzed by polyacrylamide gel electrophoresis. Fig. 2 illustrates 
the expected Hae III and Eco Rl cleavage sites in our con- 
structed plasmids. Construction A would be expected to syn- 
thesize ovalbumin whereas construction B would not be ex- 
pected to synthesize it. Hae III digestion would allow deter- 
mination of the orientation in which the ovalbumin gene is 
inserted relative to the lac control region; Hae III/EcoRI double 
digestion would show whether both EcoRI sites are present. 

Fig. 3 shows the results of these restriction digestion analyses 
for eight transformants. Lanes 17 and 18 contained the Hae III 
digest and the Hae III/EcoRI double digest of pOP203. The 
largest fragment in lane 17 contained the lac control region and 
was 1052 base pairs long. This fragment was cut by EcoRI to 
yield the two fragments, 350 and 700 base pairs, seen in lane 
18. 

Lanes 1, 3, 5, 7, and 9 contained plasmid DNAs in orientation 
A (Fig. 2); lane 11 contained plasmid DNA in orientation B. As 
shown in lanes 2, 4, 6, 8, 10, and 12, these transformants all 
contained two EcoRI sites. 

Lane 13 contained a plasmid in which a deletion apparently 
occurred and could be explained if the krger band in a plasmid 
with the ovalbumin gene in orientation A had lost some D'NA, 
giving rise to the observed doublet at about 1600 base pairs. The 
double digest in lane 14 indicates that one of the EcoRI sites had 
been lost, suggesting that a deletion in the 1800-base-pair 
fragment included the EcoRI site. Lanes 15 and 16 contained 
a plasmid in which a deletion occurred in the pOP203 vec- 
tor. 

The in situ immunoassay results showed that colonies con- 
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^ dA'dt ovalbumin lac 

1 — r-^ t =f r 

Hae III Eco Rl Hae III Eco Rl Hae III 
350 *< 1450 840 x 700 *■ 



B ovalbumin dA-dT , 

"! t t ' r 

Hae III Eco Rl Hae III Eco Rl Hae III 

« 1190 ^ 2150 *■ 

Fig. 2. Restriction digestion maps of plastnids containing the ovalbumin gene, showing the two possible orientations of that gene relative 
to the lac control elements. 



taining the plasmids whose Hae III digestion patterns are shown 
in lanes 1, 5, 7, 9, and 13 of Fig. 3 produced ovalbumin. All of 
these plasmids had the ovalbumin gene inserted in orientation 
A. The plasmid in lane 3 also was in orientation A but did not 
produce ovalbumin, p)erhaps due to a frame shift caused by the 
addition or deletion of bases during the biochemical manipu- 
lations required for fusion. 

Fig. 4 is a photograph of a Coomassie blue-stained Na- 
DodS04/polyacrylamide gel in which several standards as well 
as immunoprecipitated ovalbumin had been electrophoresed. 
Lane 2 contained an anti-ovalbumin immunoprecipitate of an 
E. coli HBlOl (pUClOOl) extract. The top band is immuno- 
globulin heavy chain (50,000 daltons); the bottom band is im- 
munoglobulin light chain (23,500 daltons). The dark band be- 
tween the heavy and light chains in lane 2 is the E. coli oval- 
bumin. Although several other very faint bands are visible, it 
is clear that the immunoprecipitation is specific. 

Lane 1 in Fig. 4 contained an immunoprecipitated E. coli 
HBlOl (pUClOOl) extract that had been subjected to incuba- 
tion for 4 hr at 37°C. The size of the E. coli ovalbumin was not 
reduced by this self-digestion. Lanes 3, 4, and 5 contained 
protein standards to assist in the determination of the molecular 
weight of E. coli ovalbumin. Lane 3 contained alkaline phos- 
phatase (43,000 daltons), lane 4 contained purified egg white 
ovalbumin, and lane 5 contained periodate-treated (19) egg 



white ovalbumin. Whereas the untreated egg ovalbumin mi- 
grated slightly faster than the E. coli ovalbumin, gentle treat- 
ment with periodate, which should remove carbohydrate 
residues, caused them to migrate at the same rate. 

By using a gel diffusion immimoassay with egg white oval- 
bumin as a standard it was found that between 1.35 and 1.6% 
of the cellular protein was ovalbumin. Spectrophotometric 
scanning of Coomassie blue-stained NaDodS04/polyacryl- 
amide gels also indicated that about 1% of the cellular protein 
was ovalbumin. 

We found about a 50% decrease in the amount of ovalbumin 
produced in the absence of isopropylthiogalactoside, indicating 
that synthesis is under lac control. We expected only a small 
decrease because the multicopy pUClOOl plasmids contain 
functional lac operator, resulting in nearly constitutive levels 
of transcription in the absence of inducer. 

To determine whether E. coli secreted ovalbumin, bacteria 
containing the pUClOOl plasmid were grown in the presence 
of isopropylthiogalactoside and tetracycline. The cells were 
treated with lysozyme and EDTA to give protoplasts. Cen- 
trif ugation of the protoplast preparation yielded a supernatant 
fraction containing periplasmic proteins and a pellet containing 
protoplasts. ^-Galactosidase was used as a marker for nonse- 
creted proteins and alkaline phosphatase was used as a marker 
for periplasmic proteins. 






Fig. 3. Polyacrylamide gel electrophoresis of plasmid DNA 
restriction digestion fragments. The odd-numbered lanes are Hae 111 
plasmid digestions and the even-numbered (n -I- 1) lanes are the same 
plasmid subjected to Hae lll/EcoHl double digestion. 



Fig. 4. Coomassie blue-stained NaDodS04/polyacrylamide gel. 
Lanes: 1 and 2, E. coli extracts precipitated with ovalbumin antise- 
rum; 3, alkaline phosphatase; 4, purified egg white ovalbumin; 5, 
periodate-treated egg white ovalbumin. 
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Table 1. Results of protoplasting experiment 





^TGalactosidase 


Alkaline 


Ovalbumin 




activity 


phosphatase activity 


immunoreactivity 


Fraction 


Units % total 


Units % total 


/ig % total 


Periplasmic 


1,005 3.7 


32.24 73.4 


620 48.6 


Protoplast 


26,189 96.3 


11.7 26.6 


656 51.4 



Protoplasts of HBlOl (pUClOOl) were prepared essentially as described by Malamy and Horecker 
(20). /3-Galactosidase and alkaline phosphatase were assayed as described (21, 22). Units are defined 
as the enzyme activities required to produce an absorbance change of l.O/hr; the units shown represent 
the total activities from a 600-ml broth culture. 



Table 1 shows the results of a protoplasting experiment. More 
than 96% of the j8-galactosidase activity was in the protoplast 
fraction, indicating less than 4% leakage of this nonsecreted 
protein into the periplasmic fraction. Less than 75% of the al- 
kaline phosphatase activity was in the periplasmic fraction, 
suggesting that some of the periplasmic protein is not released 
into the supernatant. Table 1 shows that approximately 50% of 
the ovalbumin was found in the periplasmic fraction and 50% 
in the protoplast fraction. In view of the extremely low level 
of j8-galactosidase leakage into the periplasmic fraction, this 
indicates that the chicken ovalbumin was actively secreted by 
E. coli, although perhaps not as efficiently as E. coli secretory 
proteins such as alkaline phosphatase. In no case was any im- 
munoreactive ovalbumin found in the grovvth medium. Na- 
DodS04/polyacrylamide gel electrophoresis of anti-ovalbumin 
immunoprecipitated periplasmic and protoplast fractions has 
shown that the secreted and nonsecreted ovalbumin have the 
same molecular weight. 

DlSCUSSIOiy 

We have shown that full-length ovalbumin molecules are 
synthesized in bacteria from information encoded in the 
pUClOOl plasmid. Because the E. coli ovalbumin expression 
from pUClOOl is induced by isopropylthiogalactoside, we 
believe that ovalbumin mRNA synthesis is initiated at the 
plasmid lac promotor. From Fig. 5, showing the first 107 bases 
of this mRNA, it can be seen that if protein synthesis is initiated 
at the /3-galactosidase AUG (position 39) the ovalbumin mole- 
cule wall be synthesized with 18 extra amino acids on its amino 
terminus. Addition or deletion of bases during the in vitro 
construction of pUClOOl would result in a frame shift, and 
ovalbumin would not be expressed. This seems to be the most 
likely explanation for lack of expression from the plasmid in 
lane 3 of Fig. 3 and, if true, would demonstrate that protein 
synthesis is initiated at the /3-galactosidase AUG. 

Our data, however, do not allow us to conclude directly 
whether the /3-galactosidase protein synthesis initiation site is 
used for ovalbumin expression. The results from polyacrylamide 
gel electrophoresis show that the E. coli ovalbumin has a mo- 
lecular weight of about 43,000, but this value is dependent upon 



standards whose gel migration characteristics have not been 
extensively characterized. It is possible that the 18-amino acid 
tail is synthesized and then cleaved from the molecule by in- 
tracellular proteases. This question will have to be looked at in 
more detail, however, before any firm conclusions may be 
drawn. 

Although 1.5% of the cellular protein is ovalbumin in E. coli 
carrying the pUClOOl plasmid, this level is considerably lower 
than the level theoretically attainable in this system. We have 
determined (unpublished data) that there are between 25 and 
30 copies of the pUClOOl plasmid per chromosome equivalent 
in E. coli. It is known that /3-galactosidase, with one gene copy 
per chromosome equivalent, is expressed as 2% of the total E. 
coli protein in fully induced cultures (23). Therefore, consid- 
ering the gene copy numbers as well as the differences in mo- 
lecular weight between ovalbumin and /3-galactosidase, we 
would expect approximately 19% of the total protein in our 
pUClOOl-containing bacteria to be ovalbumin. In fact, al- 
though a significant amount of ovalbumin was synthesized, it 
was less than 10% of the theoretically predicted amount. 

The most probable explanations for lower-than-theoretical 
levels of ovalbumin in cells carrying the pUClOOl plasmid are 
inefficient transcription, inefficient translation, and proteolytic 
degradation of ovalbumin molecules. 

It has been reported that, when a lac promotor with the UV5 
(CAP-independent) mutation is cloned onto a multicopy plas- 
mid, it functions with full efficiency (24). Thus, it is also likely 
that transcription is fully efficient in the case of pUClOOl. In- 
efficient translation of the ovalbumin mRNA may result from 
either inefficient initiation or inefficient elongation. It is known 
that at least 7 times the normal level of /3-galactosidase can be 
made when the gene is on a multicopy plasmid (25). Therefore, 
it is likely that protein synthesis initiation is not limiting in our 
system. The rate of elongation of mRNA translation will be 
dependent upon many factors, one of which is the availability 
of aminoacyl-tRNAs with anticodons complementary to the 
codons on the mRNA. The sequence data from several proka- 
ryotic and eukaryotic genes, including ovalbumin, have re- 
vealed nonrandom utilization of synonymous codons (16, 26, 
27). Although not enough data have been collected to permit 



AAUUGUGAGCGGA 

10 



UAACAAUUUCACACAGGAAACAGCU 

20 30 



Fmet Thr Met He Thr Asp Ser Leu Glu Phe Pro Lys 

AUGACCAUGAUUACGGAUUCACUGGAAUUCCCGAAA 

40 50 60 70 



Asp Asp Ser Glu Phe Thr Met Gly Ser He Gly 

GACAACUCAGAGUUCACCAUGGGCUCCAUCGGU. 

80 90 



Ova 1 burnt n 



AUG^ 5- Predicted pUClOOl mRNA sequence initiated at the lac promotor and predicted amino acid sequence initiated at the j8-galactosidase 
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any generalizations, it is probable that some of the tRNA iso- 
accepting species required in relatively large amounts for the 
translation of ovalbumin are present in E. coli in low concen- 
trations, thus limiting synthesis. 

The third explanation for lower-than-theoretical ovalbumin 
production is proteolytic degradation. This may occur either 
with nascent polypeptide chains on polysomes or with mature 
ovalbumin molecules, or both. Our self-digestion data with E. 
coli cell extracts suggest that the mature ovalbumin molecules 
are stable to proteolytic digestion in vitro, but our data do not 
extend to the stability of nascent polypeptide chains. 

The finding that ovalbumin molecules synthesized in E. coli 
are secreted into the periplasmic space suggests that common 
features exist in the secretion-recognition mechanisms found 
in chicken oviducts and in E. coli. More experiments will have 
to be done in order to determine what structural features of the 
bacterial ovalbumin cause it to be secreted. The ovalbumin 
naturally secreted from chicken oviducts is a complex protein, 
containing an amino-terminal acetyl group, carbohydrate 
residues, and phosphate residues. It is not clear what role, if any, 
these post-translational modifications play in secretion, and it 
has not been determined whether or not the bacterial ovalbu- 
min is modified. 

Whatever the mechanism for ovalbumin secretion in the hen 
oviduct, it would now appear that some aspects are common 
among living organisms. In this regard it should be noted that 
the secretory "signal" for chicken ovalbumin is probably not 
a hydrophobic amino-terminal sequence (28), as is the case with 
many secretory proteins (29, 30, 31). Thus, there may be ad- 
ditional structural features that are generally recognized for 
secretion. 

Although further work is needed to maximize the synthesis 
of eukaryotic proteins in bacteria, the microbial production of 
significant amounts of chicken ovalbumin reported here 
demonstrates that the E. coli cellular machinery may be utilized 
to synthesize a eukaryotic protein that is stable in the bacterial 
intracellular environment. There is no reason to believe that 
the synthesis of other high molecular weight animal proteins 
will be any more difficult than that of ovalbumin. 

Note Added in Proof. Results similar to those presented here involving 
synthesis of an ovalbumin-like protein by E. coli harboring a recom- 
binant plasmid have recently been reported (32). 

We thank Dr. B. W. O'Malley for pOV230 and for supplying 
pOV230 sequence data prior to publication. We also thank L. Korn 
for his computer program, F. FuUer for supplying the pOP203 plasmid, 
and Dr. V. S. Malik for suggesting it as a source of lac control ele- 
ments. 

1. Itakura, K., Hirose, T., Crea, R., Riggs, A. D., Heyneker, H. L., 
Bolivar, F. & Boyer, H. W. (1977) Science 198, 1056-1063. 

2. Villa-Komaroff, L., Efstratiadis, A., Broome, S., Lomedico, P., 
Tizard, R., Naber, S. P., Chick, W. L. & Gilbert, W. (1978) Proc. 
Natl. Acad. Sci. USA 75, 3727-3731. 

3. Boyer, H. W. & RouUand-Dussoix, D. (1969) Mol. Biol. 41, 
459-472. 



4. Rodriguez, R. L., Bolivar, F., Goodman, H. M., Boyer, H. W. & 
Betlach, M. C. (1976) in Molecular Mechanisms in the Control 
of Gene Expression, eds. Nierlich, D. P., Butter, W. J. & Fox, C. 
F. (Academic, New York), pp. 471-477. 

5. McReynolds, L. A., Catterall, J. F. & O'Malley, B. W. (1977) Gene 
2, 217-231. 

6. Guerry, P., LeBlanc, D. J. & Falkow, S. (1973) J. Bacterial 116, 
1064-1066. 

7. Clewell, D. & Hehnski, D. (1970) Biochemistry 9, 4428-4440. 

8. Wensink, P. C., Finnegan, D. J., Donelson, J. E. & Hogness, D. 
S. (1974) Cell 3, 315-325. 

9. Lowry, O. H., Rosebrough, N. J., Farr, A. L. & Randall, R. J. 
(1951) J. Biol. Chem. 193, 265-275. 

10. Ullrich, A., Shine, J., Chirgwin, J., Pictet, R., Tischer, E., Rutter, 
W. J. & Goodman, H. M. (1977) Science 196, 1313-1319. 

11. Goulian, M., Lucas, Z. J. & Romberg, A. (1968) Biol. Chem. ■ 
243, 627-638. 

12. Sugino, A., Goodman, H. M., Heyneker, H. L., Shine, J., Boyer, 
H. W. & Cozzarelli, N. R. (1977) Biol. Chem. 252, 3987- 
3994. 

13. Laemmli, U. K. (1970) Nature [London) 227, 680-685. 

14. Moir, A. & Brammar, W. J. (1976) Mol. Gen. Genet. 149, 87- 
99. 

15. Skalka, A. & Shapiro, L. (1976) Gene 1, 65-69. 

16. McReynolds, L., O'Malley, B. W., Nisbet, A. D., Fothergill, J. E., 
Givol, D., Fields, S., Robertson, M. & Brownlee, G. G. (1978) 
Nature (London) 273, 723-728. 

17. Maizels, N. M. (1973) Proc. Natl. Acad. Sci. VSA 70, 3585- 
3589. 

18. Dickson, R. C., Abelson, J., Barnes, W. M. & Reznikoff, W. S. 
(1975) Science 187, 27-35. 

19. Spiro, R. G. (1966) Methods Enzymol. 8, 44-49. 

20. Malamy, M. H. & Horecker, B. L. (1964) Biochemistry 3, 
1889-1892. 

21. Lederberg, J. (1950) /. Bacteriol. 60, 381-392. 

22. Torriani, A. (1966) in Procedures in Nucleic Acid Research, eds. 
Cantoni, G. L. & Davies, D. R. (Harper & Row, New York), pp. 
224-235. 

23. Miller, J. H. (1972) in Experiments in Molecular Genetics (Cold 
Spring Harbor Laboratory, Cold Spring Harbor, NY), pp. 
398-404. 

24. Backman, K. & Ptashne, M. (1978) Cell 13, 67-71. 

25. O'Farrell, P. H., Polisky, B. & Gelfand, D. H. (1978) /. Bacteriol. 
134, 645-654. 

26. Sanger, F., Air, G. M., Barrell, B. G., Brown, N. L., Coulson, A. 
R., Fiddes, J. C, Hutchison, C. A., Ill, Slocombe, P. M. & Smith, 
M. (1977) Nature (London) 265, 687-695. 

27. Efstratiadis, A., Kafatos, F. C. & Maniatis, T. (1977) Cell 10, 
571-585. 

28. Palmiter, R. D., Gagnon, J. & Walsh, K. A. (1978) Proc. Natl. 
Acad. Sci. USA 75, 94-98. 

29. Habener, J. F., Rosenblatt, M., Kemper, B., Kronenberg, H. M., 
Rich, A. & Potts, J. T., Jr. (1978) Proc. Natl. Acad. Sci. USA 75, 
2616-2620. 

30. Inouye, H. & Beckwith, J. (1978) Proc. Natl. Acad. Sci. USA 74, 
1440-1444. 

31. Blobel, G. & Dobberstein, B. (1975) Cell Biol. 67, 835-851. 

32. Mercereau-Puijalon, O., Royal, A., Cami, B., Garapin, A., Krust, 
A. Gannon, F. & Kourilsky, P. (1978) Nature (London) 275, 
505-510. 



Board Assigned Page #1 803 



TABO 



Board Assigned Page #1 804 




;rodupkx 
EC-5 and 
y were as 
; constant 
n the two 
luced by a 
engths of 
nds of the 



iber 1978 | pfature Vol. 276 21/28 December 1978 

MOPC-149 fragment is shown in Fig. 5b. The fragments are 
homologous for 13.5 kilobases, but also diverge at their 5 end. 
In each case, the divergent heteroduplex bubble represents 
DNA that is different in the embryonic sequence. Evidently 
these new sequences are derived from a reconibination-hke 
event which joined DNA sequences: to the 5' ends of plas- 
macytoma DNA fragments and eliminated sequences present in 
the embryonic fragment (see diagram. Fig. 6). By measuring the 
lengths of the single-stranded DNAs found in each hetero- 
duplex, we can estimate the lengths of the new DNA sequences 
that have been joined to the embryonic EcoK\ fragment. The 
points at which the embryonic fragment differs from each re- 
arranged fragment defines the site at which the recombmation 

event took place. a*™/- i/io 

The recombination events that formed the new MOPC-MQ 
and MOPC-41 fragments involved different sites. In Mt^f *-" 
149 the recombination event appears to have occurred 2 7 
kilobases from the constant region gene and removed the 
MOPC-41-like J region sequence from the embryonic fragment. 
Presumably the MOPC-149 light chain uses a different J region 
sequence. The recombination event that generated the MOPC- 
41 light chain sequence occurred 3 .7 kilobases from the constant 
region sene, very close to the J region sequence found in the 
embryonic fragment. Here a single recombination event may 
have linked the MOPC-41 variable region to the MOPC-41 J 
region to generate the cloned fragment. However until the 
nucleotide seuences of the appropriate regions have been 
determined, we cannot be certain that the recombination event 
that generated the MOPC-41 gene did not actually replace the 
embryonic J region with a MOPC-41 J region at the same 
relative position. It is also possible that both J region sequences 
are present on the MOPC-41 -derived fragment. Nevf heless 
these analyses indicate that the recombmation events that aff ect 
immunoglobulin light chain genes can involve sites at least l.U 
kilobases apart. 

The emerging picture of antibody gene 
arrangement and rearrangement 

By characterising cloned, k chain-encotog se^ents of DNA 
derived from mouse embryo and antibody producmg cells, we 
demonstrate directly that immunoglobuUn constant and van- 
able region genes undergo a somatic rearrangement that brings 
together the segments necessary to encode a complete k light 
chain gene. Moreover, in each light chain producmg plasma- 
cytoma that we have thus far examined there is evidence of 
?mL rearrangementof one setof these genes whikasec^^^^^^^^ 
presumably allelic, set retains the original embryonic pattern. 
Ce observations fit well with the plausible notion that re- 
arraneement of a single allele is instrumental in immunoglobuhn 
gene Sion and fhat it may play a part in allelic exclusion m 
antibody production. 
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In addition we see that, like the mouse A light chain fes 
are encoded in at least three segments in embryonic DNA, a 
foreshortened variable region sequence , a short J sequeiice 
probably encoding the last 10 or so amino acids of the variable 
region and, finally, the constant region sequence. Rearrange- 
ment of these sequences (which may itself contnbute to the 
generation of antibody diversity) forms, in the case of the 
MOPC-41 light chain gene, a new chromosomal sequence m 
which variable and J sequences are adjacent, but separated from 
the constant region gene by a large 3.7-kilobase mtervenmg 
sequence of DNA. In the case of MOPC-149, this rearrange- 
ment has eliminated the embryonic J sequence and replaced it, 
in our clone, with a new segment of DNA. Whether this new 
segment is a portion of the intervening sequence hnking MUPC- 
149 variable and constant regions in genomic DNA is not 
known. In contrast to what might have been expected, these 
rearrangements do not occur at a single site, but the new DNA 
segments are joined at sites about 1.0 kilobases apart on the 5 
side of the constant region gene. The precise nature and the 
molecular basis of this recombination remams unclear, further 
studies should indicate whether or not reciprocal recoinbination 
is involved and whether this recombination is mediated by 
regions of sequence homology. , 

We thank Dr Fred Blattner for the ACH4A phage system, 
Barbara Norman, Marion Nau and Aya Leder for help m the 
gene purification and cloning experiments, Margery Sullivan for 
help with the electron microscopy and Dr Henry I. Miller for 
help with preparative gel electrophoresis. We thank Aya Leder 
for suggesting the gene orientation experiments shown in Fig 
3t, and Terri Broderick for assistance in preparation of this 
manuscript. 
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Synthesis of growth hormone by bacteria 
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R T^rrm- advances in recombinant DNA technology have led to 
fhfSrbacteria of/V^-^S^^^^^^^^^ 

iii\tit:ibrcS^^^^^^ 



A hybrid gene was constructed between the ^-lactamase 
tenLfplasmid pBR322 and the cloned coding sequence 
for ra growth hormone. This gene is expressed m bacteria 
and growth hormone sequences are detectable by 
immunological methods. 
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Hind HI ' 



EcoR I Hind III 




r.g. 1 construction of a vector for bactenal hes^^ ^» 
s^auences The cloned RGH cDNA was cleaved from plasmid 
nRGHTfref 2) with KmdIII endonuclease and isolated by poly- 
SSL df gel electrophoresis. This DNA was enzymafcaU^ 
S^d lo similarly cJved and ^^^^^^ ^^^^^1^-':^':^^ 
pMB9 The ligation mixture was used to transform £. ^o™'" 
^mSMsl^B) colonies a>t^^^^^^^^ 

rtZ RfiH seauences by gel analysis of such plasmids atter 
Son^"S eicle^e m a^^^^^^^^ 

il,„Ms RfiH DNA was found to have its poly dA fli ena 
?rren4?ed to "ards the single BcoRI site in pMB9. Such a plasm.d 
(nS RGW used in the subsequent steps for vector con- 
Sn^?hepre.RGHse,ue„cesfromp^^^^^^^^ 

fe?5n?bl fo ba terial tetracycline resistance'. The amp and tet 
;iEpBR322wassimilarlydigested with these two en^mesJ^^^ 



Table 1 Radioimmunoassay of RGH m extracts from minicelb containing 
pEx-RGH or pBR322 



Plasmid 

pEjt-RGH 
pBR322 



RGH by radioimmunoassay (ng ml 

Periplasmic space 
30.8 (30,0, 31.6) 
Undetectable 



SpheroplastS 
89.1 (70-8-116.6) 
Undetectable 



One litre oE B coU P64g-54 containing either pEx-RGH ov pBp72 was grown 
.0° mr^on afd »lleld by cantrifugation at lO.OOOg for 10 mm^The «Ik 
nrrspheroplasts were prepared by modifi^tion o P-^"-/ ^'^ 
fl«whcre^* Briefly, cells were washed once in 200 ml of lOmMins ipn lAj 
'cLnS 1 nM PMSF Cph-V'™e«»>y'*«vl Auonde S^^^T^^^^^ 
rKusoended in 50 ml of 20% sucrose, 0.033 Tns (pH 8.0 ana imM rmsr. 
SS o 1 M EDTA (pH 8.0) was added (0.5 ml) and the matures were kept on 
S f™ 5 ml -4er ad^tion of 1 m! of 5 mg 

were incubated for 30 min at 0°C and centnfuged (17 OOOg. 10 mm 0 C)^ The 
fmS M^ClS /aStoafinalcon'^^^ 

rf DNase (1 mgml-') and RNase (S mgrnr^). Incubation proceeded for 1 h at 
:=?Seropl^t sample). These mixtures UOml) and 50ml of he s^^^^^^ 
contaitring periplasmic material were P7'P;'f/„:"*'SH 9 0) 0 MNaQ 
tion). The samples were resuspended in 4 ml of 50 {"^'f " ^'^4 Seri» 
Za \ mM PMSF and dialysed against 2 1 of the same buffer for 2 h at 4 c. senai 
t ons ot*fhL sampled werl analysed by J-?tTht^, 
. ,19 A .,„tc of HfJH found bv this assay are listed in ng ml wiinine 

'm'ra^drang^'lSS^^ 

rdTnSvXLforduplicate samples from the periplasmicspacesam^^ 
activity was detected in samples from pBR322. The sensitivty — ™ 

cells during spheroplast formation. 



growth hormone (RGH) sequences in bacteria, is show" in Ftg^ 
1 The bacterial sequences for this gene were excised from the 
Lw^prpTasmid PBR322 (ref. 9). They ^ "^J^^^f" 
Se gene (Lp') coding for the enzyme ^-lactamase which is 
e rS into'the perfplasmic space is r-po^f ^ J- 
bacterial resistance to the antibiotic ampicilUn . The man^^ 
maUan sequences were provided by cloned complementary 
SNrcontlLngtheentirecoding region for the precursor^^^^ 

of rat pUuitary Gtf , Functional fusion of the two donor DN^ 
i'affrcilitatSd by Icnowledge of their P^^-^^^^ ^^^^i^,;, 
Both DNAs possess a unique cleavage site for the 
endonucleaseVl at which they can be joined ^o^^-^l'^^^ 
correct reading frame (Fig, 2). In the amp gene the P sr^^ 
situated approximately two-thirds towards the ^n^^ Kun 
cDNAtheoorrespondingsiteoccursafewbasepairsdistaltothe 
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TRANSLATION- 



PREGROWTH HORMONE 



p- LACTAMASE 



-23 
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-21 



economiSf fS.d production. The species specificity and size 
(S aS acids) of GH- prevents us from exploitingexisting 
risources or applying current synthetic techniques. Thus, for 
GH and sS/fo? a variety of other ^-^-ly^'^.^^'^^, 
genetically programmed bacteria may provide an important 

resource. 

Gene construction 

rru^ .^„=tn,rtinn of a hybrid gene between bacterial and mam- 
X DNr^e^lces, caplble of directing synthesis of rat 
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AMP\ /RGH 



Pst t 



EcoRI 



Hind 111 EcoRI Ps*l "'"^^ 

EXPRESSION PIASMID 
Re. 2 Postulated nucleotide sequence around the ^^'^ ' " ene 
hybrti gene. The Pstl site is at the fusion between th. «mP S 

JequenL- and the cloned RGH cDNA^ ^^^^^J^^^^^^^r^io 
oaminoacidresiduesofpre-^-lactamaseandnegativenum 

those of the RGH pre-sequence. 



Fig 



3 Proteins 
pBR322. The mir 
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methods^. A culti 
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resuspended in 2 
medium. After 3 
methionine and 
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initiator codon for the pre-peptide. The resultant new gene 
Ihould code for a chimaeric protein of 395 ammo acids (molecu- 
HeS ~ 44,000) containing the N-terniinal l81 ammo a^^^^^ 
of the l-lactamase precursor covalently linked to 214 residues 
of rat pre-GH, As this fusion protein carries the signal peptide ot 
8 -lactamase, it seems possible that it might be transported to the 
periplasmic space. Such an extracellular location might mmi- 
n^^^o^ mise possible protease attack and facilitate isolation. 

rhe cells were 
res described 

Tris(pH7.4) a b c 

"he cells were 

1 mM PMSF. 
were kept on 
) the mixtures 
in.O'O.Ttie 
iai from the 
(pHS.O) and 
ti 200 fii each 
ded for 1 h at 

' the samples ^ | 

(50% satura- 
i),0.1MNaCl 
at 1°C. Serial 
as described 
nir' with the 
and (he mean 
sample. No 
nimutn^of the 
I ng ml . The 
> lysis of some 



own in Fig. 
;d from the 
1 portion of 
se which is 
onsible tor 
The mam- 
plementary 
cursor form 

3norDN^ Fig. 3 Proteins synthesised from Pl^;"'d%PEx-RGH and 
ructures"-'". "mi^ll The minicell-producing E. cob strain P678-54 Uef- 
: restriction ^^""^transto^ed with pEx-RGH or pBR322 by standard 

distaltothe ruiSt^^'ucine-andtnet^^^^^^^^^^^^^^ 

reS"wh ch"were then vigorously shaken for 30 tnin at 
^7 ?C Sr a 30-min chase period with 400 ^.g of nonradioactive 
LetSo^Se and leucine, the minicdis were pelleted by cento- 
ruSonat8.000gfor2minandha^^ 

were Ivsed in 200 )il of SDS sample buffer . The second halt oi 
r^ch Seen preparation was used to isolate the P"tems secreted 

?.ophoresedonalO%po^yacr^^^^^^ 

a^e . reflects the proteins secreted into the penplasmic space of 
SfcdirSSng PBR322. Arrow 1 ref e. lo the J^a^.on of the 
fusion protein and arrow 2 indicates ^-lactamase. 
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EXPRESSION* PBR322 .'^^Si^'- ' 



PLASMID 



To enable selection of plasmids carrying the correct hybrid 
Je the cloned GH cDNA (originally ^^f P^^^^^ 
DBR322) was first transferred from the HmdIII site of pBR322 
nto the homologous site of pMB9. a closely related p asm.d 
;S^ilLkXamp^gene.Intheresultantrecombmar.tplasim^ 
RGHUhe gene for tetracycline resistance is expressed 
SSels-'one such plasmid wh^J. — GH 
cDN A inserted in the orientation shown in Fig, 1 was used as the 
SiSXmp' gene sequences. Replacement of the unique 



Kfi 4 Immunological detection of RGH sequences m bactena. 
Bacterial colonies containing the constructed expression vector 
nEx RGH were assayed for the presence of RGH sequences usmg 
^ modification'^ of the solid-phase immunological screening 

LTod e :nd Gilbert- f -f/MTirtoa^M 
replaced with phosphate-buffered salme flMNaCl. 0,025 M 

potassium phosphate. pH 7.4 f J '"jl 

NP40 (0 1%) Bacterial colonies of E. cob strain P 678-54 (ref. 13) 
carryingpEx-RGH or pBR322 were grown ™ L-broth agar m 
Petri dishes. A photograph of such a plate is shown on the right. 

treatment (20 min) with chloroform vapour the cok>ni« 
were exposed to polyvinyl strips coated with antiserum to RGR 
Str PS were then soaked in the modified wash buifer (see above to 
minimise binding of IgG in the subsequent reaction. The strips 
~enLubaldovVnightat4°Cwith affinity a,u«^ 
«n-anti-RGH IgG (lO' c.p-m., specific activity -5.3 p-Ci per M-g), 
washS and exposed to X-ray film. An autoradiogram is shown on 
the left. 



Pstl-Bamm fragment in pMB9-RGH with that from pBR322 
should place the coding sequences for rat GH under control o 
IhZmp' gene and will at the same time restore a fully functional 
5 g^ne (see Fig. D- Thus, this fragment replacement enabled 
ele'ion of bacterial colonies carrying the -P-^^'°" 
'pEx-RGH' by their abiUty to grow on high levels of tetra- 
cycline. 

Gene expression 

Purified minicells" were used for the study of proteins encoded 
brthe exTes^ion vector pEx-RGH. Proteins were analysed by 
SDS polvacrylamide gel electrophoresis and compared with 
thoL enSdby PBR322 (Fig. 3) andpMB9 (datanotshowa).^ 
Sipated. Z two polypeptides P--^-lactamase and^- 
lactamase which are synthesised m cells carrying pBR322 are 
„5 made from pEx-RGH. The expression vector codes 
Squely for a polypeptide (Fig. 3, lane b, arrow 1) h an 
:;;Trent moIecSlar weight (46,000) -^^in error hm^^^^^^^^^^^ 
exoected for the hybrid protein. A comparison of band mtensi 
S (corrected for cell number) from several experiments similar 
;^SsSbedinFig.3suggeststh^ 

oroduced in substantial quantities from pEx-RGH, alttiougfi in 
Ke? amounts (about one-fifth) than ^-lactamase from 
pBR32rWhen Voteins released aft- the formatio^^^^^^^ 

small amounts in the periplasmic space if at all. 

Immunological detection of growth hormone 
svnthesised by bacteria 

Escherichia coli colonies containing pEx-RGH were screened 
fo 'SfcXt of RGH using a modification" ^ 
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immunological method^ (Fig. 4). Briefly, colonies were lysed ^ 
^ -^k rwri and anv RGH sequences present in the cells 

antisemm to ROH raised in monkeys (see ref. 16 for »" 

. ^„„««hv A<i shown in Fig. 4, colonies containing the 
autoradiography, as snown m rig , anti-RGH-IeG 
exoression plasmid were labelled with the 1-anti k^jH ig" 
nrote whereas those containing pBR322 failed to do so. 
' sS;ra^ cLn" ols demonstrated the '^^^ISS 
of the assay for RGH and strengthened the cone usion that the 
clones are producing pituitary GH sequences (Fig. 5). First 
clones are P™au & v j ^^^^^^^^j ^^^^ not 

SefoConies containing pEx-RGH was prevented when a 
SfexceTs (250-fold) of nonradioactive GH was incubated 
w th raSt ve probe. Thirdjabelling was not ^ 
:l:imo".eysLumorcalfseru.^^^^^^ 
in place of anti-RGH antiserum. Finally, I-labeliea norm 
monkey serum failed to label any colony from the strams 
described above (not shown). 



ADDITIONS 



SENSITIVITY 



PLASMID 



ANHSIBOWTOOH _ , 

U^BB^GH SERUM ■ 




Fig. 5 immunological detection of GH ^eq^s synt^^^^^^^^^ 
were treated as described m the legena lo r ig. ,i , 

place 0 RGH „*(rGH) sticks to plastic even m 

contammg P^^f^^^^ '^'f "Ae sensitivity of the assay was 
the absence of anti-KUtl igvj- ""^ V.*-h no 0 8 na 

above. 



obJemd in similar isolates from bacteria containing pBR322. 

Discussion 

tn the oresent studies, bacteria have been genetically pro- 
Sa^'L'dTSnUe mammalian pitu.^ 
Equences. Further analysis is required to assess the fidelity 



.uch expression, the stability of the protein products and their 
teloS ac ivi y. The absolute quantity of GH synthesised by 
S Kria cannot be determined accurately from the current 
studies (an estimate of 24.000 molecules per cell .s oto^d 
om the solid-phase immunoassay). However. ]udgmg from the 
prom nence of the band in Fig. 3, it seems that sequen^s o^ 
RGH can he a major plasmid product. The quantity of GH 
preSnt ^ less than that of ^-lactamase; however, the study 
shown in Fig. 3 may result in an underestimate of the amount of 
synthesised, because in more recent studies (not shown), 
SLS of ow;r molecular weight (in addition to the l^ger 
mo ecSar weight material) is specifically immunoprecipitated 
W anSemm to RGH but not by antiserum to bovine serum 
lumin This material could consistof degradation products or 

the products of premature termination. 

The amp' gene was used for expressing RGH coding 
sequencesiiththehopethatthemammaUanhormonew^^^^^^^^ 

secreted by the bacteria. Such secretion might facilitate the 
solTrnofthe protein product andalso protect the protemfro^^ 

delX^n by'proteas'es. The absence of GH sequences m the 
periplasmic space is surprising m vie* of the ^at Te for 
Kcmaroff et a! have recently constructed a gene suitable tor 
Kl^nofratinsulin through linkage with^^^^^^ 
and found that the protein product was ^^^^^^^ed Jt is pos b^^ 
that the hydrophobic region corresponding to the pre-part ot 
RGH in the middle of the hybrid protein prevents the kmd of 
a,nformation required for transport through membranes 
MarTy, the overall structure of the fusion polypeptide migh 
orTven ^rrect processing of the precursor sequence. More 
TaSsTudiessLldhelp determine whichfa^^^^^^^^^^ 
pre-sequence of the signal peptide protein of the hormone, are 

"XrSSTilS^^ 

coding sequences from higher organisms can be expressed n 
Scleria Thus, it should be possible to produce b.ologicalj, 
imoortant peptides with the use of recombmant DNA tecti 
nXs and Lturally occurring structural gene sequences^ 
Safly Se capability of expression of genes that are designed 
SecS or synthesised- should make pos^^ble the produc- 
tion of new hormone analogues for examinmg the struc ural 
requirements for agonist or antagonist -ity «r fo^^^^^^^^^ 
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ABSTRACT Synthetic genes for human insulin A and B 
chains were cloned separately in plasmid pBR322. The cloned 
synthetic genes were then fused to an Escherichia coli /3-ga- 
lactosidase gene to provide efficient transcription and transla- 
tion and a stable precursor protein. The insulin peptides were 
cleaved from |3-galactosidase, detected by radioimmunoassay, 
and purified. Complete purification of the A chain and partial 
purification of the B chain were achieved. These products were 
mixed, reduced, and reoxidized. The presence of insulin was 
detected by radioimmunoassay. 

Recently improved methods of DNA chemical synthesis, 
combined with recombinant DNA technology, permit the de- 
sign and relatively rapid synthesis of modest-sized genes that 
can be incorporated into prokaryotic cells for gene expression. 
The feasibility of this general approach was first demonstrated 
by the synthesis, and expression in Escherichia coli, of a gene 
for the mammalian peptide somatostatin (1). 

Following the precursor protein approach used for soma- 
tostatin (1), the experimental design for this work was such that 
the insuhn peptide chains would be made in vivo as short tails 
joined by a methionine to the end of /3-galactosidase. After 
synthesis, the insulin chains, which contain no methionine, can 
be cleaved off efficiently by treatment with cyanogen bromide. 
We deliberately chose to construct two separate bacterial 
strains, one for each of the two peptide chains of insulin: the 
21-amino-acid A chain and the 30-amino-acid B chain. In na- 
tive insulin, the two chains are held together by two disulfide 
bonds, and methods have been available for years for joining 
the chains correctly, in vitro, by air oxidation (2). The efficiency 
of correct joining has been variable and often low. However, 
by using S-sulfonated derivatives and an excess of A chain, 
50-80% correct joining has been obtained (3). 

The synthetic plan and chemical synthesis of the DNA 
fragments coding for the A and B chains of human insulin were 
described in a previous paper (4) and were summarized in Fig. 
1 of that paper. In this communication, we describe the as- 
sembly and cloning of the genes for the A and B chains, their 
insertion into the carboxy terminus of the E. coli iS-galactosidase 
structiu-al gene, the expression and purification of the separate 
A and B chains, and their joining to form native human insu- 
lin. 

MATERIALS AND METHODS 

Bacterial Strains. E. coli K-12 strain 294 {endA, thi~, hsr~, 
hsmi^} (5) was provided by K. Backman. E. coli K-12 strain 
DI210, a lac+ (iQo+2 + y+) derivative of HBlOl, was con- 
structed by J. Betz and J. Sadler and obtained from J. Sadler. 



The publication costs of this article were defrayed in part by page 
charge payment. This article must therefore be hereby marked "ad- 
vertisement" in accordance with 18 U. S. C. §1734 solely to indicate 
this fact. 



Enzymes and DNA Preparations. T4 DNA ligase and T4 
polynucleotide kinase were purified as described (6). Restriction 
endonuclease £co Rl was purified by the procedure of Greene 
et al. (7); Hindlll was purified by a method developed by D. 
Goeddel (unpublished). Restriction endonuclease Bam HI was 
purchased from Bethesda Research (Rockville, MD); E. coli 
alkahne phosphatase was purchased from Worthington. 

Plasmids, including pBR322 (8), were isolated by a published 
procedure (9) with some modifications. The chemical synthesis 
of the deoxyoligonucleotides (figure 1 of ref. 4) has been de- 
scribed (4). Xp/ac5 DNA was isolated as described (10). 

The following reaction buffers were used: kinase buffer, 60 
mM Tris-HCl, pH 8/15 mM 2-mercaptoethanol/lO mM MgClz; 
ligase buffer, 20 mM Tris-HCl, pH 7.5/10 mM dithiothreitol/10 
mM MgCl2; BamHI buffer, 20 mM Tris-HCl, pH 7.5/7 mM 
MgCl2/2 mM 2-mercaptoethanol; EcoRI-Hfndlll buffer, 
BamHI buffer containing 50 mM NaCl; and phosphatase 
buffer, 50 mM Tris-HCl, pH 8/10 mM MgClj. 

Assembly of Insulin Genes. The assembly of the right (BB) 
half of the B-chain gene (see figure 1 of ref. 4) will be described 
in detail. Oligonucleotides B2-B9 were phosphorylated indi- 
vidually. Fifty microcuries of [7-32p]ATP («2000 Ci/mmol, 
New England Nuclear) was evaporated to dryness in a 1.5-ml 
polypropylene tube, then incubated with the oligonucleotide 
( 10 /tg) and 8 units of T4 polynucleotide kinase in 60 /tl of kinase 
buffer. After 20 min at 37°C, 10 nmol of ATP and 10 units of 
T4 kinase were added and the reaction was continued for an 
additional hour. The kinase was inactivated by heating at 90° C 
for 5 min. 

Phosphorylated fragments B2, B3, B6, and B7 (2.5 ng each) 
were combined with 2.5 /tg of 5'-OH fragment Bl and dialyzed 
for 2 hr against 1 hter of hgase buffer. ATP was added to a 
concentration of 0.2 mM, the reaction mixture (60 /tl) was 
cooled to 12°C, and T4 ligase (50 units) was added. A separate 
ligation reaction involving phosphorylated fragments B4, B5, 
B8, and B9 and unphosphorylated BIO was performed identi- 
cally. After 12 hr at 12°C, the two ligation reaction mixtures 
were combined, additional T4 ligase (40 units) was added, and 
the mixture was incubated at 12°C for 4 hr. The mixture was 
extracted with phenol/chloroform and precipitated with eth- 
anol, and the DNA fragments were purified by electrophoresis 
on a 10% acrylamide gel (11). The most slowly migrating band 
was sliced from the gel and the DNA was extracted (11). 

A similar procedure, with the following exceptions, was used 
to assemble the left (BH) half of the B-chain gene. All eight 



Abbreviations: BB, left half of insulin B gene; BH, right half of insulin 
B gene; A(SS03~), S-sulfonated derivatives of the insuUn A-chain 
peptide; B(SS03~), S-sulfonated derivatives of the insulin B-chain 
peptide. 

^ To whom reprint requests should be addressed. 
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oligonucleotides (H1-H8 in figure 1 of ref. 4, 30 fig each) were 
phosphorylated. Therefore, after complete hgation and before 
purification by gel electrophoresis, the reaction mixture was 
treated with 400 units of EcoRI and 400 units of Hindlll for 
2 hr at 37°C. The BH band migrating at 46 base pairs was 
eluted from a 10% acrylamide gel. 

The procedure used to construct the A-chain gene was also 
similar to that described for the BE fragment. The only major 
difference was that, after assembly, the 5' ends of the complete 
A gene were phosphorylated. 

Construction and Characterization of /ac-Insulin Hybrid 
Plasmids. The BB fragment was cloned as follows: 1 iig of 
pBR322 was treated with 5 units of BamHl in BamH\ buffer 
for 1 hr at 37° C. After addition of NaCl to 50 mM, Hindlll (5 
units) was added and the reaction was continued for 1 hr. The 
enzymes were inactivated by heating at 70° C for 10 min. The 
prepared pBR322 was hgated to the BB fragment for 3 hr at 
12°C in 25 n\ of ligase buffer (containing 0.16 mM ATP) by 
using 20 units of T4 ligase. Half of the resulting DNA was used 
to transform E. coli 294 by a published procedure (12). The BH 
fragment and the A-chain gene were cloned similarly, with the 
appropriate restriction endonucleases to cut pBR322. 

Construction of the plasmids for expression of the synthetic 
insulin genes is described in the legend of Fig. 1. The separate 
chains in insulin are biologically inactive (2) and were synthe- 
sized attached to large precursor proteins. Therefore, the con- 
tainment level of P2-EK1, recommended by the National In- 
stitutes of Health guideline, was used. 

DNA Sequences. The method of Maxam and Gilbert (11) 
was used to determine DNA sequences. Sequence data are not 
included, but will be provided upon request. 

Preparation of Insulin Reagents. Porcine and bovine insulin 
were purchased from Sigma. The S-sulfonated derivatives 
(SSOs") of their A and B chains were prepared and purified as 
described (13). ^ss-Labeled A(SS03-) and B(SS03-) were 
prepared similarly except that 5 mCi of sodium ['^jsulfite (69 
mCi/mol, New England Nuclear) was substituted for unlabeled 



sodium sulfite. After separation of the chains, the specific ac- 
tivity was 92,000 cpm/jig and 32,000 cpm/^ig, respectively, 
for A and B chains. The radioimmunoassay for the insulin 
chains is described in the legend of Fig. 3. 

Purification of B Chain of Human Insulin. E. coli 
D1210/pIBl was grown to late logarithmic phase in 7 liters of 
LB medium (10) containing 20 mg of ampicillin per liter. Iso- 
propyl-/S-D-thiogalactoside was added to a final concentration 
of 2 mM, and the cells were grown for one more doubling. Wet 
cell paste (24 g) was suspended in 30 ml of BB buffer (10) and 
cells were lysed by one passage through a French press al 4000 
lb/inch^ (27.6 MPa). The cell debris was pelleted by centrifu- 
gation at 15,000 rpm for 30 min. The pellet was dissolved in 40 
ml of 6 M guanidinium chloride/1% 2-mercaptoethanol and 
centrifuged at 30,000 rpm for 1 hr. The supernatant was di- 
alyzed overnight against 20 Uters of H2O. The precipitate, 
containing about 1 g of protein, was dissolved in 25 ml of 70% 
formic acid. Cyanogen bromide (1.3 g) was added and the 
mixture was allowed to react overnight at room temperature. 
Formic acid and cyanogen bromide was removed by rotary 
evaporation and the residue was dissolved in 50 ml of 8 M 
guanidinium chloride. S-Sulfonated derivates of the peptide 
mixture were prepared by adding 1 g of sodium tetrathionate 
and 2 g of sodium sulfite, adjusting the pH to 9 with NH4OH, 
and stirring the mixtures at room temperature for 24 hr. The 
pH was then adjusted to 5 with acetic acid and the mixture was 
dialyzed twice against 3 liters of H2O. The resulting white 
precipitate («0.6 g of protein) was pelleted by centrifuging at 
10,000 rpm for 10 min. 

Purification of A Chain of Human Insulin. E. coli 294/ 
pIAl was grown to A550 of 2 in 5 liters of LB medium con- 
taining 20 mg of ampicillin per liter. This strain is constitutive 
for /3-galactosidase and so was not induced. The 15 g (wet 
weight) of cells obtained were processed by the same procedure 
used for the B chain up through the preparation of the S-sul- 
fonated derivatives. After the pH was adjusted to 5 and the 
mixture was dialyzed against H2O to an ionic strength of about 
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Fig. 1. Construction of /ac-insulin plasmids. pBBlOl (2 >jg) (pBR322 containing tlie BB sequence) was treated with EcoRI and HmdIII (20 
units each), and the large fragment was purified oh a 10% acrylamide gel. pBHl (8 Mg) (pBR322 containing BH sequence) also was treated with 
EcoRI and HmdIII, and the small fragment was purified on a 10% acrylamide gel. These twp fragments were ligated to 2 ng of £co Rl-digested 
\plac5 in 30 id of ligase buffer with 20 units of ligase. This mixture was used to transform E.coli 294. The configuration of restriction site ends 
(V represents Hindlll; • represents EcoRI) was such that only correct joining of the two halves of the B gene would lead to viable plasmids. 
To screen for the presence of the lac fragment, we plated the transformed culture on glucose minimal plates (10) containing 40 ng of 5-bromo- 
4-chloro-3-indoyl-/3-gakctoside (X-gal) and 20 jig of ampicillin per ml. Plasmids were prepared from /3-galactosidase constitutive (blue) colonies. 
Because the XplacS lac operon fragment contains an asymmetrical Hiradlll site (14), the orientation of that fragment in the resulting plasmids 
can be determined. Plasmid samples of 1 Mg were digested with HmdIII and sized on 0.7% agarose gels. Plasmids (15-Mg samples) having the 
desired orientation of the lac fragment were then treated with EcoRI, HmdIII, and BamHl, and sized on a 10% acrylamide gel to verify the 
presence of both the BH and BB fragments. The diagram of pIBl (7.1 megadaltons) is not drawn to scale. To construct the /ac-insulin A plasmid 
(pIAl, not shown), we ligated 1 Mg of EcoRI-treated pAll (pBR322 containing the A gene) and 3 Mg of EcoRI-treated XplacS for 4 hr at 4°C. 
Transformants of E. coli 294 were selected for resistance to ampicillin on X-gal plates. Orientation of the lac fragment was determined by digesting 
plasmids purified from the blue colonies with Hindlll and BamHl. 
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Fig. 2. Sodium dodecyl sulfate/polyacrylamide gel electropho- 
resis of extracts of strain 294/pIAl. Samples were heated in O'Farrell 
sample buffer and electrophoresed in a sodium dodecyl sulfate/10% 
gel as described (15). Lanes A, B, and C: total cells; 20, 10, and 5 id, 
respectively. Lanes D, E, and F: insoluble cell debris; 20, 10, and 5 ^1, 
respectively. 

0.01 M, the mixture was centrifuged and the supernatant was 
used for further purification (see Results and Discussion). 

RESULTS AND DISCUSSION 

Assembly and Cloning of B-Chain Gene and A-Chain 
Gene. The gene for the B chain of insuhn was designed to have 
an EcoRI restriction site on the left end, a HindlU site in the 
middle, and a BamHl site at the right end. This was done so that 
both halves, the left EcoRI-Jfindlll half (BH) and the right 
Hindlll-BamHI half (BB), could be separately cloned in the 
convenient cloning vehicle pBR322 (8) and, after their se- 
quences had been verified, joined to give the complete B gene 
(Fig. 1). The BB half was assembled by ligation from 10 oligo- 
deoxyribonucleotides, labeled Bl-BlO in figure 1 of ref. 4, 
made by phosphotriester chemical synthesis. Bl and BIO were 
not phosphorylated, thereby eliminating unwanted polymer- 
ization of these fragments tlwough their cohesive ends (Hindlll 
and BamHl). After purification by preparative acrylamide gel 
electrophoresis and elution of the largest DNA band, the BB 
fragment was inserted into plasmid pBR322 that had been 
cleaved with Hindlll and BamHl. About 50% of the ampicil- 
Hn-resistant colonies derived from the DNA were sensitive to 
tetracyline, indicating that a nonplasmid Hindlll-BamHI 
fragment had been inserted. The sequences of the small Hin- 
dlll-BamHl fragments from four of these colonies (pBBlOl 
to pBB104) were determined (11) and were correct as de- 
signed. 

The BH fragment was prepared in a similar manner and 
inserted into pBR322 that had been cleaved with EeoKl and 
Hindlll restriction endonucleases. Plasmids from three am- 
picillin-resistant, tetracycline-sensitive transformants (pBHl 
to pBH3) were analyzed. The small EcoRI-Hindlll fragments 
had the expected nucleotide sequence. 

The A-chain gene was assembled in three parts. The left four, 
middle four, and right four oligonucleotides (see figure 1 of ref. 
4) were ligated separately, then mixed and ligated (oligonu- 
cleotides Al and A12 were unphosphorylated). The assembled 
A-chain gene was phosphorylated, purified by gel electro- 
phoresis, and cloned in pBR3^ at the EcoRI-BamHI sites. The 
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Fig. 3. Reconstitution radioimmunoassay for insulin chains. The 
S-sulfonated A sample was mixed with the S-sulfonated B sample 
in a 1.5-ml conical polypropylene tube and dried under reduced 
pressure. The dried proteins were suspended in 25 id of 10 mM sodium 
acetate (pH 4.5). Two microliters of 10% (vol/vol) 2-mercaptoethanol 
was added and the mixture was heated (»95°C) for 10 min. The 
mercaptoethanol was removed by ethyl acetate extraction. Two mi- 
croliters of 0.1 M glycine buffer (pH 10.6) was added, and the pH was 
adjusted, if necessary, to 9.6-10.6. The open tube was placed in a 
dessicator over moist hyamine hydroxide at room temperature for >6 
hr. A diluted aliquot of the reaction mixture was assayed for insulin 
radioimmune activity by use of a commercially available radioim- 
mimoassay kit (Phadebus Insulin Test, Pharmacia). Dilution and 
assay were done according to the instructions supplied. The ordinate 
B/Bo is the cpm in the pellet of the experimental sample divided by 
the cpm in the pellet obtained with buffer only. •, 40 iig of porcine 
ACSSOa") in 10 mM NH4HCO3 (pH 9) was mixed with 10 /.g of bovine 
B(SS03~) in the same buffer; O, porcine A(SS03") only; □, bovine 
B(SS03-) only; A,mng of porcine ACSSOs-) and 93 ng of E. coli 
B-chain fraction F-10 (cleaved by CNBr and S-sulfonated, insoluble 
at pH 5); X, fraction F-10 only; A, 100 itg of porcine A(SS03-), 93 ng 
of fraction F-10, and 3 iig of bovine B(SS03-). 

EcoRI-BamHI fragments from two ampicillin-resistant, tet- 
racycline-sensitive clones (pAlO and pAll) contained the de- 
sired A-gene sequence. 

Construction of Plasmids for Expression of A and B Insulin 
Genes. Fig. 1 illustrates the construction of the fcic-insuhn B 
plasmid (pIBl). Plasmids pBHl and pBBlOl were digested with 
EcoRI and Hindlll endonucleases. The small BH fragment of 
pBHl and the large fragment of pBBlOl (containing the BB 
fragment and most of pBR322) were purified by gel electro- 
phoresis, mixed, and ligated in the presence of EcoRI-cleaved 
Xplac5. The 4.4-megadalton EcoRI fragment of \plac5 con- 
tains the lac control region and the majority of the ;8-galacto- 
sidase structural gene (1, 14). The configuration of the restric- 
tion sites ensures correct joining of BH to BB. The lae EcoRI 
fragment can be inserted in two orientations; thus, only half of 
the clones obtained after transformation should have the desired 
orientation. The orientation of 10 ampicillin-resistant, /3-ga- 
lactosidase-constitutive clones were checked by restriction 
analysis (see legend of Fig. 1). Five of these colonies contained 
the entire B-gene sequence and the correct reading frame from 
the ^-galactosidase gene into the B-chain gene. One, pIBl, was 
chosen for subsequent experiments. 

In a similar experiment, the 4.4-megadalton lac fragment 
from \plac5 was introduced into the pAl 1 plasmid at the EcoRI 
site to give pIAl. pIAl is identical to pIBl except that the A- 
gene fragment is substituted for the B-gene fragment. DNA 
sequence analysis showed that the correct A- and B-chain gene 
sequences were retained in pIAl and pIBl, respectively. 



Board Assigned Page #1812 



Biochemistry: Goeddel et al. 



Proc. Natl. Acad. Sci. USA 76 (1979) 



109 




Fig. 4. DEAE-cellulose chromatography of an extract of strain 
D1210/pIBl. The fraction that was insoluble at pH 5 (580 mg of 
protein) was dissolved in 20 ml of 10 mM NH4HCO3 and adjusted to 
pH 9. ssS-Labeled bovine BCSSOs") (135,000 cpm; 4.4 ixg) was added 
and the sample was applied to a 2 X 60 cm column of Whatman DE52. 
Elution was with a 1-liter gradient of 0.01-2.0 M NH4HCO3 (pH 9). 
Fractions of 4 ml were collected. •, j428o; 0, 100-^1 aliquots were used 
to measure radioactivity of '•''S-labeled bovine B(SS03~); A, lOO-jtl 
aliquots were assayed for B-chain radioimmune activity by being 
mixed with 100 »jg of porcine A(SS03"") and using the reconstitution 
assay (Fig. 3). 



Expression. The strains that contain the insulin genes cor- 
rectly attached to |8-galactosidase (D1210/pIBl and 294/pIAl) 
both produce large quantities of a protein the size of /3-galac- 
tosidase (Fig. 2). Approximately 20% of the total cellular protein 
was this ^-galactosidase-insulin A or B chain hybrid. The hybrid 
proteins are insoluble and were found in the first low-speed 
pellet in which they constitute «=50% of the protein (Fig. 2). 

To detect the expression of the insulin A and B chains, we 

Table 1. Amino acid composition of E. coli insulin A chain 



Amino 
acid 




Residues per peptide 




E. coli A(SS03-) 


Porcine A(SS03-) 


Predicted 


His 


0.08 




0.08 


0 


Lys 


0.00 




0.00 


0 


Trp 


0.00 




0.00 


0 


Arg 


0.00 




0.00 


0 


Phe 


0.00 




0.00 


0 


Asx 


2.38 




2.50 


2 


Thr 


0.24 




0.28 


1 


Ser 


0.14 




0.23 


2 


H-Ser 


0.02 




0.00 


0 


Glx 


3.97 




4.58 


4 


Pro 


0.00 




0.09 


0 


Gly 


1.40 




1.48 


1 


Ala 


0.20 




0.11 


0 


Cys 


0.55 




0.00 


0 


Val 


1.15 




1.06 


1 


Met 


0.62 




0.43 


0 


He 


1.99 




1.48 


2 


Leu 


2.33 




2.35 


2 


Tyr 


1.89 




2.30 


2 



Approximately 25 iig of porcine A(SS03~) (which is identical in 
sequence to human A) and 25 ng of E. coli A(SS03~) purified twice 
by high-performance liquid chromatography were hydrolyzed and 
analyzed in parallel. The SS03~ derivatives of cysteine were destroyed 
during hydrolysis and do not register as amino acids with the program 
used. Serine and threonine were also partially destroyed. 




8 



20 



12 16 

Time, min 

Fig. 5. Reversed-phase high-performance liquid chromatogra- 
phy. {A) A sample (500 iig), purified by DEAE-cellulose chromatog- 
raphy (B chain. Fig. 4), was subjected to high-performance liquid 
chromatography at room temperature on an SP 3500 liquid chro- 
matograph (Spectra-Physics) equipped with a LiChro<!orb RP-8 0.3 
X 25 cm column (Merck EM). The eluting buffer was 50 mM NH4OAC 
with an acetonitrile gradient of 24-60%. Fractions were collected and 
dried, and samples were assayed for B-chain radioimmune activity 
by our reconstitution assay. (B) An A-chain sample (500 ng total 
protein), partially purified by aminoethyl-cellulose chromatography, 
was subjected to high-performance liquid chromatography with an 
acetonitrile gradient of 15-60%. Fractions were collected and samples 
were assayed for A-chain radioimmune activity. Solid line, A280; A, 
radioimmune activity. 

used a radioimmunoassay based on the reconstitution of com- 
plete insulin from the separate chains. The insulin reconstitution 
procedure of Katsoyannis et al. (3), adapted to a 27-^1 assay 
volume, provided a very suitable assay. Easily detectable insulin 
radioimmune activity was obtained after S -sulfonated deriv- 
atives of the insulin chains were mixed and reconstituted by the 
procedure described in the legend of Fig. 3. The separate S- 
sulfonated chains of insulin do not react significantly, after 
reduction and oxidation, with the anti-insulin antibody used. 
Our reconstitution assay, though not extremely sensitive (limits 
of detection about 1 ng), was specific and suitable for following 
insulin chain radioimmune activity during purification. 

To use the reconstitution assay, we partially purified the 
i8-galactosidase-A or B chain hybrid protein, cleaved it with 
cyanogen bromide, formed S-sulfonated derivatives, and 
partially purified the peptides as described in Materials and 
Methods. This procedure was based on our earlier experience 
with the purification of somatostatin from E. coli (unpublished 
data) and the known properties of the insulin chains. 
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Table 2. Reconstitution of radioimmune human insulin 



"A" S3mpl6 


*'B** S£iinpl6 


Radioimmune active 
insulin 


E. coli 58-HPLC* 




<0.5 




E. coli DE117t 


<0.5 


Porcine A' 


E. coli DE117 


74 


E. coli 58-HPLC 


Bovine 


45 


E. coli 58-HPLC 


E. coli DE117 


20 



Our standard reconstitution assay procedure was used (Fig. 3). The 
results are given as ng of radioimmune active insulin produced per 
20 fil of the reaction mixture. HPLC, high-performance liquid chro- 
matography. 

* Five hundred microliters of fraction 58 from an aminoethyl-cellulose 
column was chromatographed on an RP-8 column and the "A" peak 
was collected. As estimated from the peak height, the sample con- 
tained approximately 25 iig of protein. 

t Ten microliters of DEAE-cellulose fraction 117 (Fig. 4), concentrated 
to 1.6 mg of total protein per ml, was used as the "B" sample. 

' S-Sulfonated porcine A (70 iig). 

8 S-Sulfonated bovine B (10 Mg). 

Insulin B-chain radioimmune activity was detected first 
among the S -sulfonated cyanogen bromide peptides insoluble 
at pH 5 (fraction F-10, Fig. 3). The activity was enriched fur- 
ther by chromatography on DEAE-cellulose (Fig. 4). The B- 
chain radioimmune activity coeluted with S-C^^SJsulfonated 
bovine B chain. 

A portion of the material purified by DEAE-cellulose chro- 
matography was analyzed by high-performance liquid chro- 
matography on a reversed-phase RP-8 column (Fig. 5 A). This 
column separates primarily on the basis of hydrophobic inter- 
actions. The insulin B-chain radioimmune activity eluted at a 
position very close to that of bovine B chain. Good purification 
was obtained by high-performance liquid chromatography, but 
the breadth of the peak indicated that the chromatographic 
fraction was not pure. 

Another sample (1 mg total protein) of the material purified 
by DEAE-cellulose chromatography was subjected to gel fil- 
tration on Sephadex G-75 in 50% acetic acid, a system that 
completely resolves A chain from B chain. The B-chain ra- 
dioimmune activity eluted at the same position as S -sulfonated 
bovine B chain, indicating similar sizes (data not shown). 

Insulin A-chain radioimmune activity was detected first in 
the total mixture of cyanogen bromide peptide fragments ob- 
tained from the partially purified /3-galactosidase-A chain 
hybrid. The activity was enriched by pH 5 precipitation and 
aminoethyl-cellulose chromatography and purified on a mi- 
crogram scale by high-performance liquid chromatography 
(Fig. 5B). The insulin A-chain radioimmune activity eluted 
from the column at a position indistinguishable from that of 
porcine S-sulfonated A chain. Porcine A chain is identical to 
human A chain (2). 

When an excess of porcine A(SS03~) (40 /ig) was mixed, re- 
duced, and oxidized with bovine B(SS03~) (10 Mg), we usually 
obtained 10-15% correct joining to yield radioimmune active 
insulin. Reconstitution in impure mixtures was lower, as ex- 
pected. Because of this strong and variable competitive inhi- 
bition by other peptides, the amount of insulin chains in the 
extracts can best be estimated by adding to the extract a known 
amount of the chain to be assayed. This type of experiment 
(illustrated in Fig. 3) indicates that the yield of insulin chains 
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is high (approximately 10 mg from 24 g wet weight of cells) and 
consistent with the amount of insoluble j3-galactosidase protein 
obtained (at least 10^ molecules per cell). This estimated yield 
is 10 times higher than that reported for somatostatin (1). 

The evidence that we have obtained correct expression from 
chemically synthesized genes for human insulin can be sum- 
marized as follows, (i) Radioimmune activity has been detected 
for both chains, (ii) The DNA sequences obtained after cloning 
and plasmid construction have been directly verified to be 
correct as designed. Because radioimmune activity is obtained, 
translation must be in phase. Therefore, the genetic code dic- 
tates that peptides with the sequences of human insulin are 
being produced. (Hi) The E. coli products, after cyanogen 
bromide cleavage, behave as insulin chains in three different 
chromatographic systems that separate on different principles 
(gel filtration, ion exchange, and reversed-phase high-perfor- 
mance liquid chromatography). (it>) The A chain produced by 
E. coli has been purified on a small scale by high-performance 
liquid chromatography and has the correct amino acid com- 
position (Table 1 ). 

Table 2 illustrates that insulin radioimmune activity can be 
obtained entirely from E. coli products. Easily detectable ra- 
dioimmune insulin activity is produced when purified E. coli 
A chain is mixed and reconstituted with partially purified 
(«10% pure) E. coli B chain. 
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SUBIMARY 

This report describes the construction and isolation of a plasmid, derived 
from pBR322, which carries a Bglll restriction fragn'-nt of DNA containing 
the (^lactokxnase gene from Saccharomyces cerevtsme. This was accomplished 
by the following procedure: (1) Purified galactokinase mRNA, labelled with 

was hybridized to fl^fll digests of yeast DNA employing Southern's 
filter ttansfer technique to identity a restriction frs^ent containing the 
^Oactokinase gene. (2) This fragment was partially purified by agarose gel 
electrophoresis, ligated into the BamHl site of pBR322 and transformed into 
EacherUshia coli to generate a clone bank containing the galactokinase gene. 
(3) This bank was screened by in situ colony hybridization with galactokinase 
mRNA resulting in the identification of a plasmid carrying this gene. This plas- 
mid DNA hybridized wilh the galactokinase mRNA to the same extent in the 
presence or absence of a large excess of unlabelled mRNA from cells that were 
not induced for galactokinase synthesis, while l^e same amount of unlabelled 
galactose-ihduced mRNA reduced the hybridization by 95%. When this phismid 
was introduced into an E. eoli strain deleted for tiie galactose operon it caused 
the syntiiesis of low levds of yeast galactokmase activity. 



INTRODUCTION 

We have previously reported the purification of the gal 1 gene product 
(galactokinase) from S, ceremiae (Schell and Wilson, 1977). A specific anti- 
body wis i)i»pared against IJiis protein and subsequenliy used to purify the 

Ab^^Hinu: BSA, bovine semm albumin; DTT, dithiothreitol; SDS, sodium dodecyl 
sulphtAe, 
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mRNA coding foft galactoldiiase by iinmunopzecq>itation (Scbeill and Wilson, 
1978). As the next step the mRNA was used as a probe to detect a dumeric 
plasn^ contain]]^ tiie yeart gaJactoMnase ^ne. The successful iscdation of 
this plasmid would demonstxate Ihe utility of this type of sequential isolation 
(protein antibody mRNA -* DN A) to obtain probes to study and done 
many other eucaiyotic genes, whose mRNA's represent less than 2% of the 
total cdlular mRNA. 

There are two possible approadies to the cloning of a spedfic gene ficapnent 
utilizing a putified mRNA as a probe for the gene sequence. The first sq;>proadi, 
the "^ot gan** method, invcdves the screening of a done bank containu^ the 
entire yeast genome for a clone that would hyt»idize to ^lactokinase mRNA. 
However this abroach is complicated by the la^ number of rDN A dones 
that are presemt in a bank of the entire yeast gencnne as th^ wiU hybridize 
to the small amounts of rRNA viiiich contaminate most mRNA preparations. 

Alternatively, the fragment containii^ the desired gene can be identified 
by the use of the method of Southern (1973) and partially purified before the 
generation cf the done bank. This method has several advantages. First the 
partially purified done bank is much onaller than the **diot gun" bank and 
therefore is mudi easier to genevate and screen. Second, hylnidization con- 
ditions can be established for the identification of the specific gene fitagment. 
Finally, ribosomal DNA can be diminated team the done bank by the dioice 
of the proper restriction enzyme to generate the fitagment 

METHODS AND PROCEDURES 

Preparation of ***I-labeUed gaiaetokinase mRNA 

Galactokinase mRNA was purified by immunopredpitation of total yeast 
polysomes with galactokinase antibody, deproteinization of the RNA in the 
immunopredpitated polysomes, and chromatography of this RNA on poly- 
uridylic acid lint^^harose l:^ the procedure d^oibed by Scfaell and 
VITilson (1979). In vnro ^^-labelled mRNA prepared by this method did not 
have a h^ enot^ specific activity to be used to probe for the galactokinase 
gene in restriction enzyme digests of the entire yeast genome. Therefore, 
the galactokinase mRNA was purified after low level ^'P labelUng (in vivo; 
for monitoring purification) and iodinated with '^'I by a modification of the 
method described by Frensky (1976). The reaction was prepared by the 
addition of the following reagents: 3.0 n\ Na "n (0.4 Ci/ml in 50 mM NaOH; 
carrier ficee); 2.0 txl 0.15 N HNO3 in 0.2 M NaAc (sodium acetate) pH 4.8; 
1.5 Ml 0.2 mM KI in 0.1 M NaAc pH 4.8; 5.0 id RNA (0.2-0.6 ^g) in 0.1 M 
NaAc pH 4.8; 2.0 fil 0.1 M NaAc pH 4.8; 5.0 ul 1.4 mM TlCla in 0.1 M NaAc 
pH 4.8. 

After the T1(S3 catafyst solution was added, the solution was incubated 
at eO^C fox 25 min; 0.2 ml of TNE (0,1 M Tns-HCl pH 8.0, 0.1 M NaQ, 
1 mM EDTA) was added to the reaction and the incubation was ccmtinued 
for an additional 12 mm. Then 0.05 mg of E. coU rRNA was added to the 
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solution and the iodinated mRNA was pvttified away from the unieacted 

by chromatography on 0.5 ml columns of CF-11 cellulose as described 
by Prensky (1976). The resultant RNA sample contained 3* 10' cpm giving it 
a specific activify of 0.5— 1.0' 10* cpm/Mg RNA. 

Restriction enzyme analysis of yeast DNA for the galactokinase gene 

Yeast DNA was prepared by the method described by Cryer et al. (1975) 
and furthOT purified by equilibrium sedimentation centdfugation in CsCl. The 
DNA was digested by placing it in 10 mM Tris'HCl pH 7.5, 10 mM MgCh, 
6 mM DTT, 50 mM NaCl, 0.1 mg/ml BSA to a final DNA concentration of 
0,05 to 0.3 n^/ml. Resbfiction enzyme was added to the solution at a ratio 
of 1 unit of enzyme to 1 ^ig of DNA and it was incubated at 37°C for 2—5 h. 
These samples (5-50 nl) were loaded onto 1% agarose slab gels (0.3 cm X 20 cm 
X 20 cm) containing 50 mM Tris-Ac pH 8.2, 20 mM NaAc, 20 mM NaCl, 
3 mM EDTA. The gels were run at 50 mA for 10 h while dxculating the 
electrophoresis buffer between the upper and lower tanks witii a peristaltic 
pump (Wu eta!., 1976). 

the lestxiction fragments separated on these gels were transferred onto 
nitrocellulose sheets by a modification of the method of Southern (1975). 
Hybridization of ^'^I-labelled galactokinase mRNA to these sheets was per- 
formed in "Seal-A-Meal" bags in a minimal volume of 5 X SSC + 0.5% SDS 
at 65X for 20-30 h (SSC = 0.15 M NaCl, 0.015 M Na-citrate, pH 7.0). Usual- 
ly 3 ml of this hybridization solution containing 0.3 mg otE. coli tRNA and 
0.5—2.0* IQ^ cpm of "^Mabelled galactokinase mRNA was used for a 10 X 13 
cm sheet. After hybridization the nitrocellulose sheet was removed from the 
bag, washed 3 times with 200 ml of 2 X SSC + 0.5% SDS at 43''C for 30 min 
each time, and then finally washed with 200 ml of 2 X SSC for 30 min at 43**C 
three more times. The sheet was air-dried on a glass plate, coated with a solu- 
tion of 20% PFO in toluene and dried again. The sheets were autoradiographed 
at -20**C using Kodak XR-5 X-ray film with a Picker intensifying screen for 
2— 14 days. 

Preparation ofpkumid DNA 

E. coli strains containing the plasmid pBR322 (Rodriguez et al., 1977) were 
grown at 37''C in M9 media supplemented with 0.2% glucose, 0.3% casamino 
acids, 0.1 mg/ml ampicillin, and any specific stxain requirements to late log 
phase (jisoonm = 0.9). Chloramphenicol was added to the culture at a final 
concentration of 0.17 mg/ml and the culture was shaken at 37°C for 12—16 h 
to amplify the plasmid. Plasmid DNA was prepared from these cells by a modi- 
fication of the method of Tanaka and Weisblum (1975). The amount of DNA 
was determined by reading the Az^o^m and by the diphenylamine assay (Burton, 
1968). 

Extraction of DNA from agarose gels 

DNA was ^tiacted from agarose gel sections by the following method. Gel 
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sections (1.5 ml) woe dowly frozen and lhawed 3 lames and homogenized in 
a siliconized douncehomogenizer. 3 ml of TNE (20 mMTris-HapH 7.5, 
0.4 M NaCl, 5 mM EDTA) were added and ihe suspension lehomogenized. 3 ml 
of TNE-satuiated phenol were added, the suspension was rehomogenized, 
aUowed to sit at room tempeiaiture for 30 min, and centrifuged at 9000 rev./ 
min in an HB-4 rotor. After removal of the aqueous phase the phenol and 
agarose were extraeted 2 more times as described above with 3 ml TNE. The 
combined aqueous phases were extracted with TNE^turated phenol and 
ethanol-predpitated. Approx. 50% of the DN A m the gel sections was recover- 
ed by this metihod. 

Ligation and tnnsfomuttion 

Ligation of restriction enzyme fragments into pBR322 plasmid DNA 
was performed by a modification of the method of Tanaka and Weisblum 
(1975). The conditions are described as follows. Flasmid pBR322 cut with 
Bamm restriction enzyme was diluted to 22 /xg/nd hi the foUowing solution: 
30 mM Tris-Ha pH 7.5, 0.05 mM ATP. 12 mM MgClj, 50 mM NaCl, 10 mM 
urr, JSg/II restriction enzyme fragments isolated from agarose gels were added 
at a final concentration of 80 itilvci; 0.3 units of T4 Ugase were added, and the 
reaction mcubated for 12 h at 12-C. The ligation of the yeast DNA fragments 
from the gel into the plasmid was monitored by agarose gel electrophoresis in 
a 1% agarose gel contammg 90 mM Tris* borate pH 8.3, 3 mM EDTA (Greene 
etal.,1975). 

Competent cells of E, eoli strain M94 (CGSH/5346) were prepared as des- 
cribed by Morrison (1977). Transformation of these cells with plasmid DNA 
was performed by a modification of the method of Tanaka and Weisblum (1975), 
The ligation reaction mix (0.05 ml) was added to 0.06 ml of 10 mM Tris*HCl 
pH 7.5, 1 mM EDTA. 20 mM NaO, and 100 mM CaCla and 0.2 ml of com- 
petent cells were added to this solution. After incubation at 4*^0 for 30 min 
the cells and DNA solution were incubated at 42**C for 3 min with shaking. 
Then 2.7 ml of L-bioth + 0.2% ^ucose + 0.1 mg/ml ampicillin were added to 
the cell suspension and incubated at 37^C with moderate shaking for 40 min. 
Aliquots (0.1 ml = 150 tiansfonnants) were spread on L-broth plates con- 
taining 0.1 mg/ml ampicillin and 0.2% ^ucose, and the plates incubated at 
37*Cfor20h. 

The chilled pktes were replica plated (Miller, 1972), onto L-broth -i- 0.2% 
glucose plates containing 0.02 mg/ml tetracycline, and these plates incubated 
for 15 h at 37^C. The original master plate (ampicUUn) was superimposed onto 
the tetracycline plate and the colonies that did not grow on the tetracycline 
plates were picked onto new plates (L-broth + 0.2% glucose + 0.1 mg/ml 
ampicillin). 

Screening of plasmid conttttnu^stnUns for the galactokinase gene 

The strams whidi wem found to contaon plasmid pBR322 witii inserted 
yeiffit DNA (ampiciUih reisstaht/tetracycline sensitive) were grown at ZTC 
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for 15 h on L-broth plates containing 0,1 mg/ml ampicillin. Filters which con- 
tained pU^id DNA from each individual strain at the site of each colony 
were prepared on Whatman 540 paper by a modification of the method of 
Beckmann et al. (1977). To detect the clones containing the galactokinase 
gene, the filters were hybridized with '^^I-labelled galactokinase mRNA. 
20 filters (6X8 cm, containii^ 48 colonies each) were placed in a "Seal-a- 
Meal" bag and then 15 ml of 50% fozmamide, 4 X SSC, 0.5% SDS, 0.2- 10^ 
cpm/ml galactokinase mRNA were added to the bag. The hag was sealed and 
incubated at 41*'C for 24 h. 

After hybridization the filters were washed twice with 50% formamide in 
4 X SSC for 20 min with 200 ml each time at room temperature, and three to 
four times with 2 X SSC, again for 20 min each. The filters were dried and 
autoradiographed for 1 week. 

Hybridization analysis of putative galactokinase containing plasmids 

To prepare plasmid DNA fixed to nitrocellulose filters (Cooper et al., 1974), 
50 ng of plasmid DNA was added to 0.5 ml of 0.01 X SSC containing 20 ugl 
ml E. coli DNA. The DNA was denatured by heating at 98°C for 12 min and 
quickly chilled to O^C. The solution was adjusted to 6 X SSC in a final volume 
of 2.0 ml, passed slowly throt^ 25 mm nitrocellulose filters; the filters were 
washed and then baked. From each filter 5 mm diameter circles were punched 
out with a standard pf^er punch; each of these filters contained 3-4 ng of 
plasmid DNA. 

Competition hybndizatiott of the "'^I-labelled galactokinase mRNA to these 
immobilized plasmid DNAs was performed by placing each disc in 0.05 ml of 
hybridization solution (50% formamide, 0.8 M NaCl, 0.5% SDS, 50 mM 
Tris*HCl pH 7.5, 10 mM EDTA, 260 /Ltg/ml induced or non-induced polysomal 
mRNA) in conical microfuge tubes. After pre-hybridization for 40 h at 41'^C, 
Ip ng of '^'I-labelled galactokinase mRNA (0.2* 10' cpm) in 0.01 ml of 
hybridization solution were added to each hybridization tube. After an addition- 
al incubation at 4VC for 40 h, the nitrocellulose filters were assayed for hybrid 
formation by a modification of t^e method of Cooper et al. (1974). 

Assay of gt^tokinase activity 

Galactokinase assays were performed by a modification of the method of 
Schell and Wilson (1977) using [*^C] galactose with a specific activity of 
10 000 cpm/nmole. 10 mM glucose was included in some assays to block 
phosphorylation of galactose by exizymes other than yeast galactokinase. 
Protein was measured by the method of Lowry et al. (1951) using bovine 
serum albumin for reference. 

Preparation of mRNA 

Total polysomal mRNA tcom galactose-induced or uninduced yeast was 
prepared from & eerevisiae strain X108D as described by Schell and Wilson 
(1979). Total induced or uninduced mRNA was pr^and by deproteinization 
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^Kjedium was pi«paied by adding ttie following diemicals to 1 .0 Wer 
of distiHed water: 6 g Na,HP04, 3 g KH^PO*, 0.5 g NaCl, 1 g JK^X After^ 
aatodaving the medium was made to 1 mM MgSO* and 0.1 mM CaO^ by fee 
addition of appropriate amounts of sterile 1.0 M stock solutions. L-broth was 
prepared by adding 10gofbacto-tiyptone,5gNaa.and5gyeaste3tttact 

to 1.0 Utet of water; plates contained 2.0% agar. Carbon sources were mcluded 
at a final concentration of 0.2%. 



Oiemicals . ^, , • *_ v 

•ncia (K and K Laboratories); Na»«I (New England Nuclear; earner free), 
KI (Mallinckrodt); Agarose (Seakem); Nitrocettulose sheets (Millipore); CsCl 
(Kawecki-Berylco); phenol (Mallinckrodt; distilled and stored at -20 C); 
formamide (Fisher; distilled in vacuo and stored at -20'<;); DNA enzymes: 
Bamm, BgM, £coRI, T4 ligase (New Bn^and Biolabs); ampfciUin (Parke- 
Davis; AmsilrS); tetracycUne (Sigma). AH otiier chemiwls were purchased 
from Sigma Chemical Co. or were of reagent grade purity. 

Recombinant DNA procedures 

These experiments were carried out in strict compUanoe with the NIH 
Guidelines for research involving recombniant DNA molecules. The level of 
containment employed was P2 EKl as prescribed by these guidelines. 



RESULTS 

Identification of a restriction fragment containing the gaiactokinase gene 

Yeast DNA was digested witii ftg/U restiiction enzyme, subjected to agarose 
gel electrophoresis, and tiie resulting pattern of DNA fragments transferred 
to nitrocellulose filters as described in METHODa These DNA filters were 
hybridized with "^Mabelled galactokinase mRNA and autoradiographed. 
Three discrete bands (A,B,C) were consistentiy detected in the Bglil restric- 
tion enzyme digest. This is illustrated in Fig. 1 (dot 2) which is a typical 
autoradiograph of a nitrocellulose sheet containing a £Sf ni digest hybridized 
with galactokinase mRNA. Due to the larger amount of DNA coding for 
rRN A relative to sin^e copy genes it was likely that at least one of these 
bands resulted from the hybridization of small amounts of rRN A which con- 
taminates most RNA preparations. This conclusion was supported by 
preliminary Southern filter hybridization experiments with £coRI digests of 
yeast DNA, where the galactokinase mRNA preparation hybridized to known 
rDNA restriction enzyme fragments (BeU et al., 1977). Furthermore fragment 
C in the hybridization pattern in Fig. 1 has a molecular weight which is 
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Fig. 1. Hybridization of "'Habdled gatactokinase mRNA to BgUl restriction enzyme digests 
of yeast DNA. Yeast DNA was digested with restriction enzyme, fractionated by 
agarose gel dectrophoreste and transferred to nitrocellulose sheets. The DNA digests were 
hybridized with: Slot 1 « 10 /ig of BgtU digested DNA hybridized wiUi 10* cpm of 
labelled total induced polysomal mRNA. Stot 2 " 10 Mg of BgtU. digested DNA hybridized 
with 10* cpm of '"MabeUed gdactokinase mRNA. Slot 3 » 10 ^ig of BglU digested DNA 
hybridized with 10* cpm of '"MabeUed yeast ribosomal RNA. Molecular weight values 
were calculated usii^ £eoRI digested \'phage DNA that was run on the same gel as a reference. 

identical to tiie size of the £^211 restriction enzyme fragments containing 
the yeast rDNA (J. Szostak, personal communication). 

This hypotiiesfe was confirmed by hybridizing B^ll digests of yeast DNA 
on nitrocelltilose sheets with "^X-Iabelled yeast rRNA or total mRNA from 
non-induced yeast ceUs. The results of these hybridizations are illustrated 
in Fig. 1 (slots 1 and 3). It is clear that fira^ents A and C are probably 
rDNA feagments since they are the only bands in Bglll digests that hybridize 
yeast rRNA and non-induced mRNA. Fragment B did not hybridize to either 
rRNA or non-induced mRNA and therefore it was likely that fragment B with 
a molecular weight of 3.7 • 10^ was the BgHI restriction fragment containing 
the galactokinase gene. 

PmikU puiificaUon of the jutoctofe those gene fi^agment 

In Older to isolate preparative amounts of this DNA fragment, 400 ng of 
yeast DNA were digested with 30 units of f^fll restriction enzyme for 22 h 
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at arc in the solutioii described in METHODS. After etiianol precipitation 
the redissobed DNA was run on a preparative 1% agarose gel (4 slots; 0.6 cm X 
4 cm) at 70 mA for 18 h; \ DNA cut with EcoBl was run on the same gel as 
a molecular we^t marker. 

Since Ihe molecular wei^t of the putative galactotdnase gene contdmmg 
i^m restriction hagmtoA was 3.7' 10*, the area of the gel containing ihe 
restriction enzyme feagments of molecular wei^t 3.4' 10* to 4.0- 10* was cut 
from the gel and Ihe DNA estracted from the agarose pieces as described in 
MFFHODS. The final yield of DNA from ihis partial purification procedure 
was 20-30 Mg. Since 50% of the DNA was lost during manipulation, it can 
be estimated that this partially purified DNA fraction containing ihe galacto- 
kinase gene fragment had been enriched 10-20-fold for this q>ecific fragment. 

Generation of a clone bank 

The DNA isolated from the gel was ligated mto pBR322 plasmid DNA at 
the AimHI restriction enzyme site as described in METHODS. The ligation 
appeared to be greater than 50% complete as estimated by electrophoresis 
using a 1% agarose gel in Tris* borate buffer (Greene et al., 1977). The mixture 
of l^ated plaanid and plasmid with inserted yeast DNA fragments was used to 
transform E, coU strain M94 by the procedure described in METHODS. Selection 
for transformed cells containing pBR322 on plates containing 0.1 mg/ml 
ampicillin showed that 1.5' 10^ ceUs/ml were ampicillin-resistant out of a total 
of 1.7' 10* viable ceUs/ml indicating a frequency of 10"* transformation to 
ampiciUin resistance per cell with the plasmid preparation. The entire trans- 
formation mixture was plated out on 25 ampicillin plates (0.1 ml = 150 
colonies/plate). When these plates were replica plated on tetracycline plates 
approx. 15% of the total ampicillin-resistant transformants were found to be 
tetracycline-sensitive. Since the yeast DNA fragments are inserted into the 
tetracycline-resistance gene on the plasmid (Rodriguez et al., 1977), these 
15% were presumed to contain plasmid with yeast DNA ligated into it. From 
three individual transformations 1500 ampicillm-resistant/tetracydine- 
sensitive transformants were Uolated. It was calculated from the purification 
factor of the galactokinase gene fragment by agarose gel electrophoresis, that 
there should be from 300—800 unique DNA fragments in Ihis done bank and 
therefore it should contain from 2—5 plasmid clones with the BgJll restriction 
enzyme fragment containing the galactokinise gene. 

Detection of pkamid contmnktg the gdactokinase gene 

Filters were prepared with DNA from each of the plasmid strains fixed at 
discrete locations on Whatman 540 psqper as described in METHODS. After 
hybridization of these filters and autoradiography, onfy 6 colonies out of the 
1500 transformants hybridized to "^I-labelled galactokinase mRNA. Each 
of these strains were grown and plasmid DNA isolated as described by 
Mealier et al. (1977). After digestion of these plasmids with EcoKl and gel 
eleclxophoresiB in 1% agarose none of the six plasmids s^peared to have 
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identical band patterns. FurChennoTe, two of Hie six unique plasmids appeared 
to contain a piece of yeast DNA that was too small to be the 3.7' 10^ BglTl 
£ra^ent. The oi^ of tiiese small plasmids is not known although ceUular 
modification of the plasmid k a possible explanation. However four of the 
plasmids contained yeast DNA fin^ments whic^ mi^t contain the galactokinase 
gene. 

Since it has hem shown tiiat mRNA isolated from non-induced polysomes 
contained no galactokinase mRNA (Schell and Wilson, 1979), tiie plasmid 
containing the yeast galactokinase gene could be identified by competition 
hybridization. Competition hybridizations of the '^^Mabdled galactokinase 
mRNA to the various plasmid DNAs fixed on nitrocellulose filt^ were per- 
formed after pie-hybridization witii an excess of unlabelled mRNA firom botii 
induced and non-induced yeast cells as described in METHODS. The results are 
presented m Table I. As expected tiie hybridization of the ^^^I galactokinase 
mRNA to the various plasmid DNAs pre^iybridized witii an excess (12 ptg 
mRNA/2 plasmid) of unlabelled induced mRNA is completely blocked. 
Hybridization of two of the plasmid DNAs (GB-4 and GB-7) was not blocked 
by an excess of non-induced mRKA while the rest of the plasmids were totally 
blocked. We conclude firom tiiis experiment that the plasmids designated 
GB-4 and GB-7 contain a sequence homologous to a galactose induced mRNA 
since tiie hybridization of galactokinase mRNA to tiiem cannot be blocked 
by unlabelled non-mduced (Le. non galactokmase-containing) mRNA. Further- 
more, when these plasmids (labelled by nick translation [Rigby et al., 1977] ) 
were hybridized to ^/n digests of whole yeast DNA on nitrocellulose filters 
both plasmids hybridized to tiie 3.7 • 10^ dalton restriction fragment that 



TABLE I 

HYBRIDIZATION OF GALACTOKINASE mRNA TO DIFFERENT PLASMID DNAs 
PRE-HYBRIDIZED WITH VARIOUS mRNA FRACTIONS 

Galactokimue mRNA (10 ng ■> 0.5 • 10* cpm "'I) was hybridized to nitrocellulose filters 
containing 3 ng of the varitMis plannid DNAs as described in METHODS. The filters were 
first pi»4iybridized with no unlabeled mRNA, 0.012 mg of unlabelled polysomal mRNA 
from non*induced yeast cells (+ NX mRNA), or 0.012 mg of polysomal mRNA firom 
galactose induced yeast cdls (+ 1 mRNA)L Values are expressed both as the >'*I cpm of 
galactokinase mRNA hybridized (nr as the percentage of hybridization in the absence of 
pre-hylvidization with unlabelled mRNA. 

"'I cpm hybridized 



Nopre-hybridizatton +NImRNA(%) +ImRNA(9&) 



GS-2 830 10 (1) 20(2) 

GB-3 2120 23 (2) 15(1) 

GB-4 860 840 <98) 25(3) 

GB-5 6100 50 (1) 60(1) 

GB-6 750 27 (4) 20(3) 

OB-7 350 325(93) 15(5) 
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hybridized onty galactokinase mRNA shown in Fig. 1. The plasmid GB-7 also 
contained an additional ftagment of sli^Uy smaller mcdecular weight; Urn 
fragment was "co-cloned^' into a plasnud v/iOi the gaiactokinase fragment to 
produce the larger plasmid. This is supported by the fact that GB-4 DNA was 
resistant to BglH digestion, while GB-7 was cleaved by J^glll to produce a 
luiear DNA molecule. Thus GB-4 and GB-7 are the same plasmid except that 
GB-7 contains an additional :£(|Fin &agm^^^ insertion 
of another inctividual BN A fin^eht into the plasmid molecule. 

Expression of yeast gi^tokirwae gene in E.ca^ 

proof that the doned fragment, which is homologous to a galactose-induced 
mRNA, does contain the functional galactokinase gene was provided by the 
following experiment. The GB-4 plasmid DNA was used to transfonn E, coU 
strain S165 which contains a deletion of the E. coli galactose operon (Starlinger, 
pers. commun.). \Vhen ^tracts of this strain containu^ the GB4 plannid were 
analysed for gsQactotdnase activity, a low but significant activity (Table II) was 
detected. Extracts of this strain containing the GB-4 plasmid exhibited a 
galactokinase activity of 2' 10*^ U,/mg, while extracts of the same strain con- 
tainii^ no plasmid or the pBR322 plasmid have an {^parent activity of !• 10~^ 
\J.[mg. When the same assays were performed in the presence of 10 mM ^ucose 
to block i^osfdiorylation of galactose not catalysed by yeast galactokinase 
(ScheU and Wilson, 1977), the GB-4 plasmid bearing strain still had an activity 
of !• 10~^ U./mg while the control strains (with pBR322 only or with no 
plasmid) have galactokinase activity which is at the limit of detection of the 
gaiactokiiuise sssasf procedure. This experiment ^ows that the cloned fragment 
contains a fonctional yeast galactokinsee gene, which can be expressed at a low 
level in £. coli. 

TABLE n 

EXPRESSION OF GALACTOKINASE ACTIVITY IN VARIOUS PLASMII>CONTAINING 
STRAINS 

Cdls of £ coli strain S16S containing the dedgnated piasmids w«re grown in 10 ml of 
L-broth containii^ 0.2% galac^xe and 0.1 rag/m! ampjciUin to A, mm * O-^' 
were harvested, imulied, and cdl extracts prepared by sonication. Extracts were then 
assayed for galactokinase activity in the preseiiM and abseiuse of 10 mM ^ucose in the 
assay mix. Activi^ levels in fiiese strains is expressed in the upits/mg protein • 10'. 



Extract 
source 


Galactokinase acthri^ (U./mg • 10*) 


no^ucose 


■i- 10 mM glucose 


S165 + 0 


1.0 i 10% 


0.05 ± 20% 


S165 + pBR322 


1.0 ± 10% 


0.05 ± 20% 


8165 + GB^ 


2.01 10% 


1.0 ilO% 
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DISCUSSION 

The isolation of a plasmid containing the yeast galactokinase gene in an 
E. co/i host is the final step in the purification of the components of the ex- 
piession of tlie galactokinase gene. The successful isolation of this plasmid 
was dependent on the sequential puxif ication of galactokinase (Schell and 
Wilson, 1977)r galactokinase antibody^ and galactokinase mRNA (Schell and 
Wilson, 1979) since each of tiiese purified components was necessary to 
produce an assay system for the galactokinase gene. This successful cloning 
expoim^t provides support for the general use of liiis sequential isolation 
metliod (protein -* antibody mRNA DNA) to clone and study many 
oti^ eukaryotic genes and their components. Although the methods used 
to isolate the galactokinase gene are not new, the cloning of a specific gene 
whose mRNA represents less than 2% of the total cellular mRNA utilizing 
these methods has not been reported. 

The isolation of the galactokinase plasmid was dependent on the use of 
very h^ specific activity *^^X-labelled purified galactokinase mRNA to 
q)ecifically hybridize to the galactokinase gene fragment in BglU digests of 
the entire yeast genome, allowing subsequent partial purification of the frag- 
ment 10- to 20-fold. Althoi^ other restriction en:^mes were invest^ated, 
the enzyme SgUl was found to produce the largest DNA fir^ment (3.7* 10^} 
containing the galactokinase gene which was easily separated from ribosomal 
DNA. Due to its large size there was a possibility that this DNA firagment 
would also contain other parts of the galactose gene cluster in addition to 
the galactokinase gene. 

The paridaliy purified DNA preparation was l^ated in the BamHl site of 
the plasnud pBR322 inserting the yeast DNA into the plasmid gene for tetra- 
cycline resistance. This allowed elimination of the large number of trans- 
formants whidi contain re-ligated plasmid without inserted yeast DNA (85%). 
In situ colony hybridization to 1500 of these clones with '^H-labelled galacto- 
kinase mRNA detected six individual plasmids that stron^y hybridized the 
galactokinase mRNA preparation. However, for four of the six plasmids this 
hybridization could be completely blocked by an excess of unlabelled mRNA 
firom cells that were not induced for the t^nVtiesis of galactokinase. Two 
different plasmids (GB-4 and GB-7) hybridized galactokinase mRNA in the 
presence of the same amoimt of non*induced mRNA that completely blocked 
the hybridization by the other plasmids. It can be concluded from these results 
that these two plasmids contain a sequence coding for a galactose inducible 
mRNA. The identity of the other four plasmids that hybridize galactokinase 
mRNA but are blocked an excess of non-induced mRNA is not known. 
Th^ could contain sequences of DNA that are homologous to the non-coding 
portjon of the galactokinase mRNA, or represent DNAs homologous to con- 
taminating mRNAs in the galactokinase mRNA preparation. 

Hybridization experiments with the galactokinase plasmids showed that a 
large excess of total ceUidar mRNA or total polysomal mRNA from unindut^d 
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cells does not block tlie hybridization of galactoldnase mRNA to llie plasmid. 
Tias suggests ftiat there is an undetectable amount of galactokinase mRNA in 
yeast cells that axe not exposed to galactose; 'fheiefdxe it seems thit ihe in- 
duf^on of galai^lanase synthesis (and probably Ihe entire gatactbse gene 
chi^t^ by ^stUi^tbie isi^ mB^A^ 
coding^foi these protc^iT^ ^ 
piedoimmmt mechamsm of gsaUctose ena^e 

theas is miemated. 

Ihe proof that one of the isolated plaanids contains a dindaoi^ yeast 
galactokinase ^ne was pxoinded by the expe^ 

expression of the yeast galactokihase gene in E, coli. An E. cofi' strain, deleted 
for its galact(»e operon, cacryii^ the gs^tokinase ptesmid exhibited a galacto- 
kinase actavity of 0.001 U./mg when assayed wider conditions that are specific 
for yeast galactokmase. GaiactokiiGBse assa^ of the same strain containii^ 
either no plasmid or only pBR322 plasmid showed no apparent galactokinase 
activity under &e same conditions. Althoi^ this level of expression of the 
yeast galactokinase gene in E. coli is low» the activity is real and therefore 
strongly sog^ts that the isolated plasmid does indeed contaui a functional 
yeast galactokiirase gene. Therefore the yeast gatoctokinase gene can be fulded 
to the list of yeast g^es vrtiidi have been doned and shown to be expressed 
in E. call (aarke and Carbon, 1978). 
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Communicated by Boris Magasanik, August 18, 1978 

abstract From a large population of strains of Esche- 
richia coli carrying shear fragments of yeast {Saccharomyces 
cerevisiae) DNA attached by jd vitro recombination to the 
plasmid vector pMB9, two hybrid plasmids were selected that 
relieve the pyrimidine requirement of nonreverting pyrF mu- 
tants of E. coli. An 1100-base-pair DNA fragment common to 
the two complementing plasmids was recloned into another 
plasmid vector, pBR322; these new hybrids retained the ability 
to specify orotiaine-5'-phosphate decarboxylase (orotidine-5'- 

Shosphate carboxy-lyase, EC 4.1.1.23) synthesis in K coli. Evi- 
ence is presented that this common fragment is yeast DNA and 
thus apparently carries the structural information for yeast 
oroUdineS'-phosphate decarboxylase, the product of yeast gene 
ura3. A hybrid plasmid containing the 1100-basfrf>air fragment 
was used to measure levels of putative uraS mRNA from yeast 
cultures labeled with pH]adenine. ura3 mRNA was unstable 
with an apparent half-life of 10.5 min. Under different cir- 
cumstances previously shown to alter the level of orotidine-5'- 
phosphate decarboxylase in yeast, a coordinate variation in 
proportion of labeled RNA complementary to the hybrid plas- 
mid was found. These data support the hypothesis that regula- 
tion of the urai gene in yeast is at the level of transcription. 

The recent development of in vitro recombination methods and 
the associated technology has made it possible to isolate specific 
eukaryotic genes. In the case of yeast {Saccharomyces cerev- 
isiae) it has been found that several genes for enzymes in me- 
tabolism can be expressed when introduced into Escherichia 
coli on either plasmid or phage vehicles (1-3). We report in this 
paper that the yeast gene that specifies orotidine-5'-phosphate 
(OMP) decarboxylase (orotidine-5'-phosphate carboxy-lyase, 
EC 4.1.1.23), the last enzyme in the pathway for the biosyn- 
thesis of pyrimidines, can be expressed in E. coli. The gene's 
complementation activity was retained by a fragment of DNA 
about 1100 base pairs long. 

Prom genetic and physiological studies of the regulation of 
the pyrimidine pathway in yeast, Lacroute (4) had obtained 
evidence that the «ro3 gene (which encodes OMP decarbox- 
ylase) is inducible by earlier intermediates in the pyrimidine 
pathway. For example, he found that mutants in another gene 
(ural; defective in dihydroorotase), which accumulate high 
intracellular levels of dihydroorotic acid, contain 5 times more 
OMP decarboxylase. 

We used a hybrid plasmid containing the 1100-base-pair 
yeast DNA fragment containing the uraS gene to measure levels 
of uraS mRNA in yeast by hybridization. We found that the 
level of «ra3 mRNA varied coordinately with the levels of the 
OMP decarboxylase activity and concluded that the Mra3 gene 
is regulated at the level of transcription. 



The publication costs of this article were defrayed in part by page 
charge payment. This article must therefore be hereby marked "ad- 
vertisement" in accordance with 18 U. S. C. §1734 solely to indicate 
this fact. 



MATERIALS AND METHODS 

Yeast and Bacteria. Pyrimidine auxotrophs of S. cerevisiae 
were all derived from the wild-type prototroph a strain FLIOO 
(4, 5). The construction of the yeast/plasmid hybrids is fully 
described elsewhere (6). All manipulations involving Hve in 
vitro clones were done under P2,EK1 containment as suggested 
by the National Institutes of Health guidelines (7). The bacterial 
strains all were derivatives of E. coli K-12. Strain DB6656 is 
pyrF::Mu trpam lacZam hsdK The pyr~ mutation reverts at less 
than 10~^ and was confirmed by enzyme assay. The plasmids 
were pMB9 {tet^) and pBR322 (tet^ dmp«) (8). 

Screening of DNA Pools for Complementation. In order 
to make DNA pools, the hybrid yeast/pMB9 clones were in- 
oculated onto LB (9) agar plates containing 15 ng of tetracycline 
per ml and incubated overnight at 37° C. The cells were washed 
off these plates into erienmeyer flasks (about 250 clones per 
flask) containing LB broth with tetracycline. The total growth 
of the pooled clones together before DNA extraction was only 
about 2 doublings, ensuring that each clone that grew up into 
a spot on the plate was well represented in the extracted plasmid 
DNA. The DNA pools were used for transformation of strain 
DB6656 (6, 10), selecting for pyrimidine independence. 

Analysis of Complementing Hybrid Plasmids. Isolation of 
plasmid DNA and retransformations were done exactly as de- 
scribed (6). Analysis of plasmid DNA with site-specific re- 
striction endonucleases was carried out as described by Sharp 
et al. (11). The hybridization of "nick-translated" radioactive 
plasmid DNA (12) to digested yeast DNA by the method of 
Southern (13) was carried out essentially as described (14, 15). 
The slots containing plasmid DNA were loaded with about 1 
lig of DNA whereas the yeast slots contained 5 ng each; this 
accounts for the immense imbalance in degree of hybridiza- 
tion. 

The cloning of Hindlll fragment b into pBR322 was ac- 
complished by standard methods (16, 17). The ligated DNA was 
used to transform strain DB6656, selecting for growth on 
minimal plates lacking uracil. Transformants were purified and 
tested for resistance to ampicillin and tetracycline. Some of the 
omp'* Pyr"*" strains were more resistant to tetracycline than 
others, but all failed to grow in the presence of tetracycline (15 
/ug/ml), consistent with the idea that the b fragment had been 
inserted into the HindlW site of pBR322 (8). 

Determination of OMP Decarboxylase Activity. The 
method of Wolcott and Ross (18) was used, with minor modi- 
fications. Extracts of yeast and bacteria were made by sonica- 
tion in the cold. Specific activity was determined relative to the 
protein concentrations (19). 

Labeling and Extraction of Yeast RNA. Labeling of RNA 



Abbreviation: OMP, orotidine 5'-phosphate. 
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was done with C^HJadenine (20 /tCi/ml; 25 Ci/mmol; Saclay), 
which was added to yeast cells growing exponentially in excess 
uracil. Samples were taken and the cells were either arrested 
by the addition of 2 vol of cold ethanol or chased with nonra- 
dioactive adenine (50 Mg/ml) and arrested later. During the 
chase, total acid-insoluble radioactivity increased no more than 
40% in the first few minutes and remained constant thereafter. 
Equilibrium labehng was done by incubation for at least three 
generations in medium containing 20 ^Ci of [^Hjadenine per 
ml in a total adenine concentration of 10 tig/m\; these condi- 
tions result in only 20% incorporation of the radioactivity. 

RNA was extracted from cells by a minor modification of 
methods described previously (20); no carrier RNA was added 
and sodium dodecyl sulfate was omitted from the second eth- 
anol precipitation. RNA samples were lyophilized after the final 
ethanol precipitation and taken up in hybridization buffer. 

Hybridization. Nitrocellulose filters with plasmid DNA were 
prepared by filtration under denaturing conditions (21, 22). 
Sartorius filters (type 11306, 0.45 ^ni, 25 mm diameter) were 
loaded with plasmid DNA, and 5-mm-diameter microfilters 
containing 1-2 fig of DNA were punched out of them. Hy- 
bridization was done under conditions slightly modified from 
those of Kourilsky et al. (23). The hybridization solution was 
0.3 M NaCl/0.03 M Na citrate/40% (vol/vol) formamide 
containing 0.02% bovine serum albumin, 0.02% polyvinyl- 
pyrrolidone, 0.02% Ficoll, and 0.2% sodium dodecyl sulfate to 
decrease the background (24). Radioactive RNA was dissolved 
in this solution (usually 2 ml) and 0.2-ml aliquots were incu- 
bated for 2 days with the DNA filters at 37° C in a sealed glass 
vial. Vials contained filters loaded with yeast hybrid plasmid 
DNA and, as control, with vector plasmid DNA. Control ex- 
periments verified that hybridization was complete under these 
conditions when there was 0.3 ng or more of DNA on the filter 
and that even with the most abundant RNA there was a 20-fold 
excess of DNA on the filters. Filters were washed, treated with 
RNase, dried, and assayed for radioactivity in a scintillation 
counter for at least 20 min. 

Hybridizations were done in triplicate or quadruplicate. 
Values given in tables and figures are the difference between 
two means — (mean cpm on filters loaded with yeast/ vector 
hybrid DNA) — (mean cpm on filters loaded with vector 
DNA) — divided by the total acid-insoluble radioactivity. The 
background was about 40 cpm (filters with vector DNA gave 
results not significantly different from filters with no DNA); 
usually more than 10® cpm of acid-insoluble radioactivity was 
applied. 

RESULTS 

Isolation of Hybrid Plasmids in E. coli Carrying the Yeast 
Gene Specifying OMP Decarboxylase. Selection for function 
of the yeast gene specifying OMP decarboxylase was performed 
by using a nonreverting mutation in .the pyrF gene, which is 
the E. coli structural gene for OMP decarboxylase (24). A set 
of 2500 yeast/E. coli plasmids was constructed by insertion of 
randomly sheared pieces of S. cerevisiae DNA into the small 
E. coli plasmid pMB9 by the terminal-transferase method. The 
details of the construction of this set were described by Petes 
et al. (6); the probabihty that any particular yeast DNA se- 
quence is present in this set was estimated to be 0.85. Groups 
of 250 E. coli strains, each of which harbored a hybrid plasmid, 
were grown up and pooled, and the plasmid DNA was extracted 
by standard methods (25). Ten pools of yeast/£. coli plasmid 
DNA were used to transform the nonreverting E. coli pyrF-.Mu 
recipient. After a short period of growth in complete medium, 
the transformed bacteria were spread on agar plates containing 
no pyrimidines. Two of the 10 plasmid DNA pools yielded some 
Pyr"^ transformants; the 8 others yielded none. 



If the Pyr"*" phenotype is the result of the presence of a spe- 
cific yeast DNA sequence, then the property should be trans- 
ferable by transformation with plasmid DNA, whether or not 
selection for growth in the absence of pyrimidines is carried out. 
This was confirmed in two ways. First, the DNA from Pyr"*" 
transformants was used to transform again the pyrF-Mu strain, 
selecting for tetracycline resistance. All drug-resistant trans- 
formants acquired simultaneously the ability to grow in me- 
dium lacking pyrimidines. Second, another E. coli strain 
(HBlOl) which does not require pyrimidines was transformed 
to tetracycline resistance. Plasmid DNA was extracted from 
these transformants and tested for ability to transform the 
pyrFr.Mu strain to Pyr''". Again, all drug-resistant transformants 
yielded DNA with the ability to confer Pyr+ phenotype. Two 
hybrid plasmids, each deriving from one of the two successful 
DNA pools, were selected for further study; these are called 
clone 1 and clone 2. 

If the complementation of the pyrF defect by the DNA of 
clones 1 and 2 were due to the presence of yeast DNA speci- 
fying OMP decarboxylase, then the two plasmids should contain 
common sequences derived from yeast. In order to test this 
further, analysis of the two plasmid DNAs was carried out by 
using site-specific restriction endonucleases. The first results 
indicated that the two plasmids were different in length, and 
that neither was cut by Eco RI. However, both were cut several 
times by Hin dlll, yielding two fragments of identical mobility 
as judged by electrophoresis in agarose slabs (Fig. 1). The vector 
(pMB9) has only one Hindlll site. Differences in length be- 
tween the two plasmids were expected because the original 
hybrids were made from randomly sheared yeast DNA, and 
the endonuclease digestion results are consistent with the idea 
that the two plasmids are independent isolates of the same re- 
gion of yeast DNA. 

In order to try to determine which parts of the two plasmids 
contained the DNA responsible for the complementation of the 
pyrF defect, an experiment was carried out in which the Hin- 
dlll fragments were ligated into another £. coli plasmid vector, 
pBR322 (8). This vector carries genes conferring resistance to 
both ampicillin and tetracycline. The insertion of DNA into the 
single Hwdlll site on this vector causes loss of the tetracycline 
resistance (8). Hindlll digests of clones 1 and 2 were mixed with 
Hin dlll-digested pBR322 DNA and treated with T4 polynu- 
cleotide ligase. The Hgated DNA was used to transform £. coli 
pyrF::Mu and, as before, selection was made for independence 
of pyrimidines. Pyr+ colonies were tested for resistance to 




Fig. 1. Agarose/ethidium bromide electropherograms of plasmid 
DNAs. Samples of purified plasmid DNA (approx. 0.5 ^ig per lane) 
were digested with Hindlll and electrophoresed as described by Sharp 
et al. (11). Lanes labeled "Recloned" are plasmids obtained by re- 
cloning the Windlll digests of clones 1 and 2 into pBR322 followed 
by selection of Pyr+ after transformation of strain DB6656. 
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ampicillin and tetracycline; Amp'*, Tet*, Pyr+ colonies were 
found, purified by single-colony isolation, and grown up, and 
plasmid DNA was isolated from them. After digestion with 
Hin dm, each of these plasmids yielded two fragments, one 
with the mobility of pBR322, and one with the mobility of the 
larger of the two HindlU fragments (fragment b) common to 
clones 1 and 2 (Fig. 1). Thus, the ability to confer Pyr+ phe- 
notype could be narrowed down to a single fragment. Analysis 
of the length of this fragment in gels with 0X174 markers gave 
a result of about 1100 base pairs (data not shown). 

It was important to show that the complementing DNA in 
the hybrid plasmids was derived from yeast. The technique of 
Southern (13) in which DNA fragments displayed on an agarose 
gel are adsorbed to nitrocellulose paper and hybridized with 
labeled probes is ideal for this purpose. Several such experiments 
were carried out on EcoBI and Hindlll digests of total yeast 
DNA from two different strains of yeast, using as probe the 
hybrid plasmids described above. Fig. 2 shows one such ex- 
periment. The radioactive probe was clone 6 DNA made ra- 
dioactive by nick translation (12). Hybridization was observed 
at one position in the Hindlll digest of yeast DNA corre- 
sponding to the size of fragment b and at one position in the 
EcoRl digest of yeast DNA corresponding to a fragment larger 
than the sum of all the yeast Hindlll fragments of clones 1 and 
2. This result demonstrates that fragment b is a fragment of 
yeast DNA and that yeast DNA is responsible for the comple- 
mentation of the pyrF mutation in E. coli. 

Other Southern-type experiments, as well as standard re- 
striction enzyme analysis (not shown), permit the unambiguous 
construction of a restriction map of the region of the yeast ge- 
nome from which clones 1 and 2 derive (Fig. 3). These results 




Fig. 2. Hybridization of clone 6 DNA to total yeast DNA. Total 
yeast DNA (5 ng) from strains FLIOO (lanes 1 and 4) and +D4 (lanes 
2 and 5) was digested with EcoRl (lanes 1 and 2) or Hindlll (lanes 4 
and 5). Lane 3 contained 1 fig of Windlll-digested clone 1 DNA. The 
digested DNAs were electrophoresed through 1% agarose and trans- 
ferred to nitrocellulpse paper (13-15). "Nick-translated" (15) clone 
6 DNA was hybridized to the paper. The bands in lanes 1 and 2 indi- 
cate that clone 6 hybridizes to the same single £coRI fragment from 
both yeast strains and the bands in lanes 4 and 5 show that clone 6 
hybridizes to a single Hindlll band from both strands having the size 
of fragment b. 



BcoR\ b a EcoRl 
Yeast DNA ^W/ - I - , J //— i 

Clone 1 L- ^—7-r=-^pMB9M" 

Clone 2 " '■///////A -^ i — ^ ^^pMB9^ " I 

Fig. 3. Restriction map of yeast DNA in the neighborhood of the 
OMP decarboxylase gene. Straight arrows, Hindlll cleavage sites; 
horizonal lines, yeast DNA; crosshatched bars, pMB9 vector plasmid 
DNA; numbers, approximate lengths in kilobases. The data for 
identity of fragments a and b in clones 1 and 2 are from Fig. 1. Partial 
digests and Southern experiments like Fig. 2 yielded the map. 

show that the manipulations involved in constructing clones 1 
and 2 did not cause any gross rearrangement of the DNA from 
yeast and that two widely differing yeast strains to whose DNA 
the probe was hybridized (-HD4, from which the clones derive, 
and FLIOO, in which the ura genetics was done) are probably 
identical in this region of the genome. 

The observation that plasmids bearing Hindlll fragment b 
from yeast will allow a pyrF mutant of E. coli to grow in the 
absence of added pyrimidine makes it hkely that fragment b 
contains the information for the synthesis of yeast OMP de- 
carboxylase, because the pi/rF mutation results in the loss of only 
this enzyme. The gene that specifies OMP decarboxylase in 
yeast (4) is called ura3, and it thus appears that fragment b bears 
the uro3 gene. Two lines of evidence support this idea. First, 
extracts of £. coli pyrF::Mu bearing clone 1, 2, or 6 contained 
substantial levels of OMP decarboxylase (Table 1). Second, 
preliminary results with the transformation procedure for yeast 
recently described by Hinnen et al. (26) indicate that clone 6 
DNA will transform to uracil-independence a nonreverting 
«ro3 mutant of yeast (M. Rose and D. Botstein, unpublished 
results). 

Although clearly showing the presence of OMP decarbox- 
ylase activity resulting from the presence of the hybrid plas- 
mids, the data in Table 1 also show that the level of enzyme 
activity somehow varies with conditions of growth and that it 
depends upon the plasmid used. These variations are not yet 
understood but might indicate some kind of modulation of gene 
expression even in E. coli. 

For the present purpose, it suffices that fragment b is likely 
to contain all of the structural gene for yeast OMP decarbox- 
ylase and, from its small size (1100 base pairs), little else. Pre- 
liminary sequence information (D. Botstein and R. Tizard, 
unpublished data) is consistent with the idea that most of the 
1100 base pairs code for a protein. 

Regulation of Yeast OMP Decarboxylase Occurs at the 
Level of Transcription. Yeast OMP decarboxylase has been 
shown to be subject to induction by high intracellular levels of 



Table 1. OMP decarboxylase activity in E. coli 







Growth conditions 






Excess 


Limiting 


Host 


Plasmid 


uracil 


conditions 


E. coli pyrF 


None 


<0.001 


<0.001 




Clone 1 


0.10 


3.2 




Clone 2 


0.04 


1.1 




Clone 6 


5.9 


12.6 


E. coli pyrF''' 


None 


9.4 


42.7 



Cells were grown in M9 medium containing either excess uracil (40 
Aig/ml) or limiting uracil (1 //g/ml). When the cells had reached 
midexponential phase (excess uracil) or had starved for at least 1 hr 
(limiting uracil), they were harvested by centrifugation, resuspended 
in assay buffer, sonicated in the cold, and assayed for OMP decar- 
boxylase. Values are given as mmol of OMP destroyed per mg of 
protein per minute. 
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dihydroorotic acid (and possibly by orotic acid and OMP as 
well). In oral mutants (defective in dihydroorotase), dihy- 
droorotic acid accumulates. One finds in such mutants a level 
of OMP decarboxylase about 5 times greater than that in wild 
type; this level is only slightly affected if uracil is provided in 
excess (4). The best hypothesis for this induction is that enzyme 
biosynthesis is regulated, although alternative hypotheses have 
been difficult to rule out absolutely. 

The isolation described above of the DNA for the structural 
gene of OMP decarboxylase (i.e., fragment b in pBR322, clone 
6) made it possible to measure the level of uraS mRNA directly 
by hybridization. Wild-type yeast and an isogenic ural mutant 
(derepressed 5-fold) were grown in excess uracil so that the 
growth rates were comparable. Total RNA was labeled by ad- 
dition of C^Hjadenine to exponentially growing cells. After a 
short exposure (10-15 min) cells were either collected imme- 
diately or an excess of nonradioactive adenine was administered 
in order to observe mRNA decay. The chase became effective 
(i.e., (^HJadenine no longer was incorporated into acid-insoluble 
RNA) within 10 min. Equilibrium labeling of RNA was done 
over at least three generations under conditions that leave at 
least 75% of the radioactivity unincorporated. Hybridization 
was carried out at least in triplicate and OMP decarboxylase 
specific activity was determined in the same cultures. 

Table 2 shows the results of measurements of specific hy- 
bridization of RNA to fragment b (the uraS gene) in the two 
strains, one wild type and one induced; specific enzyme activity 
in the same cultures is also given. There was a comparable 
difference (a factor of 4-5) in the level of hybridization when 
labeling of the RNA was either in a relatively short pulse (10-15 
min) or over three generations (i.e., equilibrium). Other ex- 
periments (not shown) using different strains and different 
inducing conditions gave consistent results: the specific hy- 
bridization and the enzyme activity varied coordinately. Shorter 
pulse-labeling experiments (i.e., 2.5 min) also gave comparable 
differences, indicating that the differences in hybridization 
reflect differences in the synthesis of the RNA and not differ- 
ences in decay rate. 

In order to verify that the RNA that hybridizes to the clone 
6 DNA behaves like a normal yeast mRNA, the apparent 
half-life was measured and compared with RNA hybridizing 
to two other hybrid plasmids that were found to be comple- 
mentary to abundantly transcribed nonribosomal RNAs (6). As 
shown in Fig. 4, the ura5 mRNA had an apparent half -life of 
about 10.5 min, whereas the two putative "abundant messen- 
gers" had apparent half-lives of 7 and 14.5 min. All three 
mRNAs decayed exponentially with decay times within the 



1.0l 
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(S 

.~ 0.5 

;d 

a 0.4 



.2 0.3 




0.1 \ 

< . . . ^. . 
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Time after chase, min 

Fig. 4. Decay rates of yeast mRNA. RNA was pulse-labeled for 
10 min with [-'HJadenine followed by a chase of unlabeled adenine. 
RNA was extracted from samples taken at intervals thereafter and 
hybridized to clone 6 {uraS), clone PLl-BU ("abundant mRNA 1"), 
or clone PL13-H1 ("abundant mRNA2"). The latter two clones 
contain DNA complementary to unknown abundant unstable RNA 
and do not contain DNA complementary to ribosomal RNA (6). Hy- 
bridization is shown as a fraction of the total acid-insoluble radioac- 
tivity in the same volume of culture measured at each time point; this 
value increased less than 40% during the first 10 min of the chase and 
remained quite constant (increased slowly, less than 10%) thereafter. 
The range of values of cpm in ura 3 mRNA [(cpm on clone 6 filter) — 
(cpm on pBR322 alone); mean (± SD) of three filters each] was 220 
± 18 to 10 ± 3 cpm. ■, Abundant mRNA 1, ti/2 = 14.5 min; •, uraS 
mRNA, ti/2 = 10.5 min; ▲, abundant mRNA 2, £1/2 = 7 min. 

range (3-30 min) found for different yeast mRNAs under 
various conditions (27-30). 

DISCUSSION 

The techniques of in vitro recombination have made it possible 
to isolate and purify eukaryotic structural genes. In this paper 
we have described the isolation and identification of an 
1100-base-pair piece of yeast DNA that appears to contain the 
structural gene for yeast OMP decarboxylase, the ura3 gene. 
The gene was found by virtue of its expressibility in E. coli, a 
property previously described for various yeast genes encoding 
biosynthetic enzymes (2, 3). 

The availability of structural gene DNA in abundance made 
possible hybridization experiments to test directly whether the 



Table 2. OMP decarboxylase: mRNA and enzyme levels in induced and uninduced yeast strains 

Specific hybridization under different conditions^ 

Enzyme 15-min PL Equilibrium' 

activity* Exp. 1 Exp. 2 2.5-min PL Exp. 1 Exp. 2 

ura+ 3.8 2.4 2.0 2.2!* 0.22 ± 0.13 0.27 ± 0.14 

ural-21 19.4 10.7 9.2 9.4 0.88 ± 0.1 0.94 ±0.1 

Ratio 5.1 4.5 4.6 4.3 4.0 3.5 



Cells were exponentially growing in excess (50 Mg/ml) uracil in all cases. 

* Shown as >imol/min per mg. 

* PL, pulse label. Shown as fraction of total acid-insoluble •'H-labeled RNA X 10~^; in most cases, at 
least Vfi cpm was applied to each filter. 

' At three generations. These numbers are at the limit of significance; therefore, in each experiment 
four different filters were used for each sample. The filter-bound radioactivity was counted for 20 min 
and compared to that on four filters loaded only with vector (pB4322) DNA. Statistical analysis yielded 
accumulated errors (i.e., combined SD for blank and for sample) as indicated. Control experiments 
comparing vector blanks with filters lacking DNA showed no statistically significant difference. Other 
results in the table have computed statistical errors less than 10%. 

S A ura 2 mutant was used in this experiment; as shown previously (4), ura 2 mutants are not induced 
and have OMP decarboxylase levels identical to those of ura + strains. 
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regulation of the expression of the uraH gene occurs at the level 
of transcription. We found that the level of uraS mRNA as 
measured by hybridization to the structural gene DNA varies 
coordinately with the observed enzyme activity. It thus is pos- 
sible to conclude that the uraS gene is under transcriptional 
regulation. 

Measurements of the degree of expression of the uraS gene 
in E. coli have also been carried out. At the present time it seems 
clear that there is some variation depending upon the conditions 
of cell growth and upon the particular hybrid plasmid used. 
These variations could indicate that some kind of regulation 
might occur even in E. coli, but the nature of this variation is 
still not understood well enough to permit any firm conclu- 
sion. 

Related to this observation of variability in expression of the 
uraS gene in E. coli is the likelihood that the 1100-base-pair 
fragment of DNA contains a promoter recognized by E. coli 
RNA polymerase. At present there is no definitive proof of this 
and no indication as to whether the promoter used in E. coli 
bears any relationship to the normal promoter used by the 
relevant yeast RNA polymerase(s). 

We also measured the half -life of the uraS mRNA. The value 
we observed for this RNA (10.5 min) and for two abundantly 
transcribed RNAs (7 and 14.5 min) are in the range of published 
values for yeast mRNA decay rates (27-30). The variation 
among transcripts that we found is clearly significant, and the 
fact that these are direct measurements of particular chemical 
species should be emphasized. The fact that decay is exponential 
with little or no delay makes it somewhat less likely that the 
length of the poly(A) "tail" added to mRNA after transcription 
regulates its half-life. Nevertheless, such a hypothesis can now 
be tested because the two abundant messenger species measured 
in our experiments have such different decay rates. 

The amount of uraS mRNA we found in equilibrium label- 
ing in the wild type is about 2 X 10"^ of total RNA. Given that 
the total messenger comprises about 1.5% of total RNA, the uraS 
mRNA comprises about 1.3 X 10""'* of the total mRNA. If there 
are about 10"* genes in yeast, the uninduced level of transcrip- 
tion of uraS appears to be about average. 
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Fragments of hepatitis B virus DNA isolated from Dane 
particles have been inserted into the Escherichia coli 
plasmid pBR322 and cloned. Cells carrying the hybrid 
plasmid synthesise antigenic material that reacts 
specifically with antisera to hepatitis B viral antigens. 



Infection with hepatitis B virus (HB V) is widespread in man. 
Between 3 and 15% of healthy blood donors in Western Europe 
and the US show serological evidence of past infection and 
about 0*1% are chronic carriers of the virus. In many African 
and Asian countries the prevalence is much higher and the 
majority of the adult population have been infected, while 
5-10% of the population are chronically infected^ Most 
infections are subclinical and are followed by apparently 
complete recovery with the development of virus-specific anti- 
body. However, a significant proportion of infections (probably 
1-5%) may produce chronic sequelae including persistent 
infection, chronic hepatitis of various types, drrhosis and 
possibly primary liver cancer. 

Plasma from some blood donors and patients infected with 
HB V contains 42-nm spherical particles {Dane particles)^ which 
have serological and biochemical properties, suggesting that 
they are the infective virions of hepatitis B. These have an outer 
envelope containing the hepatitis B surface antigen (HBsAg) 
and an inner core (diameter 27 nm) bearing a second unrelated 
antigen, the hepatitis B core antigen (HBcAg). Within the core is 
a double-stranded circular DNA molecule of molecular weight 
-2x10* which has a large variable gap in one strand, and an 
endogenous DNA-dependent DNA polymerase activity that 
can fill in this gap in in vitro reactions^. There is some evidence 
that the total virus genome length may be around one-third 
greater than the 2x10* daltons found in single molecules m 
which case productive infection of a cell may require simul- 
taneous infection by at least two genetically different particles . 
A third antigen, the hepatitis B e antigen (HBeAg), which is 
probably also virus-coded, is found free in the plasma of some 
infected individuals and possibly also in association with Dane 
particles. Passively or actively acquired antibody to HBsAg 
(anti-HBs) confers some immunity to subsequent HBV chal- 
lenge, and prototype vaccines composed of purified inactivated 
HBsAg prepared from the plasma of infected carriers are being 
evaluated*. However, the virus cannot be grown in tissue culture 
and normally infects only man and apes. This means that 



* Present address: Division of Virology, I.M.V.S., Box 14, Rundle St. 
Post Office, Adelaide, S. Australia. 

t Permanent address: Max Planck Institute for Biochemistry, Am 
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molecular studies of the virus and its genome have been based 
on the limited amounts of material obtainable from the plasma 
of infected individuals. Such studies could be advanced 
considerably by insertion of HBV DNA into a bacterial plasmid 
or phage to allow its production in quantity from cloned, purified 
single molecules. Such clones would also be useful for studies of 
the expression of HBV gene products in bacterial cells, and for 
large-scale production of HBV DNA and viral antigens for 
diagnostic purposes and, possibly, vaccine production. 

Restriction of HBV DNA 

The published information on digestion products of HBV DNA 
(Dane particle DNA) with restriction endonucleases ■ is limited 
and so additional analyses of this type were carried out before 
attempting cloning experiments. The amount of HBV DNA 
available was limited (about 90 ng from 5 ml of plasma), so the 
DNA was first labelled with ^^P in the endogenous DNA poly- 
merase reaction^ DNA from bacteriophage A was then added as 
a carrier to titrate the restriction enzyme and to provide 
reference fragments of known sze. The digests were analysed by 
electrophoresis in agarose gels and the results of some of these 
experiments are shown in Fig. 1. 

The heterogeneity of the undigested labelled HBV DNA 
(tracks b and e) precluded a detailed analysis of the restriction 
digests. The multiple bands of rather similar size could be due in 
part to the presence of linear and circular forms of otherwise 
similar molecules, in part to differing degrees of repair synthesis, 
although repeated DNA preparations from the same plasma 
sample gave reproducible patterns; the heterogeneity could also 
represent a true molecular dispersity from a mixed population of 
vunons. HBV DNA preparations from several individual donors 
were all heterogeneous.and differed shghtly from each other (an 
example is included in Fig. 1 ; tracks a, b and c compared with d, 
e and fj both before and after digestion with various restriction 
enzymes. Heterogeneity has also been observed by Landers et 
al.^ in the products of the endogenous polymerase reaction 
which comprised two principal radioactive components 
representing linear and chcular molecules with additional minor 
components due to incomplete repair. Examination of the HBV 
DNA {Fig. 1, tracks b and e) by electron microscopy showed 
that the population contained circular molecules of MW~ 
2 X 10* and hnear molecules ranging from 0.5 to lOx 10 . The 
linear molecules represented about three tunes the concen- 
tration of drcular molecules and only a few of them had a MW 
around 2x10*, the majority being between 0.5 and 1 x 10*. The 
heterogeneity observed on electrophoresis was, therefore, not 
due to drcular and linear forms of similar length and few, if any, 
of the hnear molecules were labelled in the polymerase reac- 

i#<i^ Page #1836 
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HBsAg-posiiive Wood donors essentially as described by Landers « fl( .T^^^^ 30mCi ^Lmol"', respectively; Radiochemical Centte 
UbeUed'ln reactions with the '"dogenousDNA polymerase >^^^^^^ Jet were hither purified by sedimentation through 30W^'""TjSon 

Amersham'l or ^^P-dGTP (300 mCi ^ol'') were included . The released core P^nrcies we = k ^ . ^ j extraction and dialysis. RestricUon 
SK:isIlatedby.eatnten.withprotein»eK(^^^^^^^^ 

enzyme digests (37 °C. 1-5 h) were earned out ,n 10 f^J"*:"'^'' f "J were then applied to 1% w/v agarose gels" M T™ta^^^ 

m s-i „ o ner reaction) and the reactions were stopped by heating at /u u lor 3 mm. i J' . „_ jp, nv liEht" and then either placed in alkali to denature 
H S 2 tT«r"Xresis (35 niA, 8 h). Gels were sja^^^ 

DN A fragments for transfer to eeiluiose nitrate membrane filters "^"^ sts with R.£<:oRI and R.HmdIIl included m the same gel . These 

wasestiTedfromtheir^^ore.^^^^^^^ ,,,,s relative to each other. 



Digestion of HBV DNA with KEcorRl or R.BgflI changed 
thepatternof the major bands only slightly (Fig. 1 tracksc/and 
g) whereas digestion of the DNA with ^^J^^ ^J^^'?^^ 
together (track h) gave a radioactive fragment, MW O.l^xiu 
as the principal component, with little of the original DNA 
remaining. This is consistent with the introduction of a smgle 
break in circular molecules by either enzyme alone to give mtact 
Unear molecules; when the two enzymes acted together two 
fragments were formed, the smaller containmg the entire 
labelled region. On digestion with R.Kpnl (track 0 ahnost the 
whole Esroup of bands was displaced down the gel conespondmg 
with the loss from each of a fragment of MW - 0.4 x 10* but no 
corresponding radioactive fragment was found. This suggests 
that HBV DNA may contain at least two targets for R.Kp«i 
located outside the region repaired in the reaction with DNA 
polymerase and spanning a sequence common to all the mo e- 
cuUs. The alternative explanation requiring cleavage of circular 
molecules at a single target to give Unear molecules IS less likely 
in view of the behaviour observed in the R.EcoRl and R.Sgill 
digests. R.Haem furnished a spectrum of fragments with a 
range of sizes (track r) in a pattern broadly «mdar to Aat 
pubUshed' *. Digestion of HBV DNA with R.Aval 8i^TO|tAssi 



R.BamHI gave several radioactive fragments (tracks; and g), a 
major product of the R.Acal digest had a MW of 0.88 x 10 , 
while radioactive fragments of 1.2 x 10* and 1.8x 10* occurred 
in the R.BamHI digests; in other digests with R.BamHI smaller 
fragments were also observed. These results, together with the 
principal radioactive products found in various digests with paura 
of restriction enzymes (for example, R.fcoRI and R.Aval, 
track a or d) are summarised in Fig. 2. Withm the constraints 
imposed by the dispersity of the HBV DNA preparations used, 
they provided the approximation of relative positions of restnc- 

tion targets as shown. , - , . a^a 

The results imply that, in most molecules, the smgle-stranaea 
gap repaired by the endogenous DNA polymerase lies within a 
relatively constant region of the DNA sequence; Landers etat. 
similarly concluded, from an analysis of R.Haelll digests after 
different polymerase reaction times, that DNA repair ^ok place 
largely within the same one-third to one-half of the total DNA 
secTuencc. although initisttion o£ DNA repair could occur at 
variable sites within this region. d q nt the 

Digests of HBV DNA with R.£«RI or R.BgflI offer the 
possibUity of cloning the entire HBV genome, ^hde map 
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or R-Aual, or from various double digests. For structural studies 
of the HBV genome it is desirable to clone the entire DNA 
molecule, but clones covering a range of fragments could well be 
more useful for attempts to demonstrate the expression of HBV 
sequences in E. coli. Micromethods based on radioimmunoassay 
can be apphed to individual bacterial colonies or phage plaques 
for the detection of specific polypeptides^ As many eukaryotic 
genes will not be expressed in prokaryotic cells it appeared 
desirable to insert HBV DNA fragments within & prokaryotic 
gene so as to produce a fused polypeptide; for this purpose the 
R.Pjf target in the E. coli plasmid pBR322 has been used 
successfully*. 

Cloning of HBV DNA fragments in pBR322 

HBV DNA isolated from Dane particles from a single HBsAg 
positive, HBeAg positive dongr (serotype adyw) was labelled to 
a low specific radioactivity with ^H in a repair reaction with the 
endogenous polymerase in order to facilitate its handling. This 
preparation was then variously digested with R.£corRI, 
R.BamHI, R.Sg/Il, R.ApnI and R.Atjal. Portions of the 
R.EcoRI and R.BamHI digests were used for insertion at the 
respective sites of appropriately restricted pBR322 DNA by 
annealing and ligation with T4 DNA ligase. The fragments from 
the other restriction enzyme digests and the remainder of the 
R.£coRI and R.Bam digests were treated with polynucleotide 
terminal transferase for addition of 3' oligo (dC) sequences'. 
These fragments were annealed to pBR322 DNA to which oligo 
(dG) sequences had been attached after cleavage with R.Pst. 
The DNA preparations were then used to transform competent 
cultures of E. coli HBlOl and transformants were screened for 
the acquisition of HBV sequences on the following basis. Cells 
transformed with recombinants made using the R.Pst site in 
pBR322 retained their resistance to tetracycline, but became 
sensitive to ampidllin. Cells transformed with recombinants 
made using the R.BamHI site in pBR322 became sensitive to 
tetracycline, but retained their resistance to ampicillin, while 
cells transformed with DNA cloned using the R.EcoRI site 
remained resistant to both antibiotics"' and were detected by 
colony hybridisation" with '^P-labelled DNA from Dane 
particles. The presence of HBV DNA sequences in colonies 
where antibiotic resistance and sensitivities indicated the 
insertion of additional DNA into the plasmid was confirmed by 
colony hybridisation. 

Characterisation of pBR322-HBV hybrids 

Plasmid DNA isolated from cell cultures that had been treated 
with chloramphenicol to amplify plasmid production^^ was 
analysed by gel electrophoresis before and after digestion with 
restriction endonucleases. 

DNA fragments from several of the gels were transferred to 
cellulose nitrate filters for hybridisation" with HBV DNA 
labelled with ^^P by the endogenous polymerase reaction. 
Examples of these results are shown in Fig. 3 and some of the 
characteristics of the cloned segments are given in Table 1. The 
hybridisation observed with appropriate fragments does not 
establish conclusively that the cloned sequences were HBV- 
specific, for the cloned DNA and "P-labelled probe had been 
prepared from the same plasma sample. Thus contaminating 
non-viral DNA fragments that had been cloned inadvertently 
would also have been detected if the same non-viral DNA 
sequence was labelled by the endogenous polymerase ; however, 
in other experiments such '^P-labelied probes hybridised with 
DNA from HBV-infected liver tisue, but not with DNA from 
normal human liver. Furthermore, preparations of "P-labelled 
DNA made from Dane particles purified from four different 
blood donors gave the same patterns when hybridised against 
restricted DNA fragments from the recombinant plasmids, thus 
strengthening the view that the cloned sequences were in fact 
HBV DNA. 

The results presented in Fig. 3 are examples taken from the 
large number of recombinant colonies obtained from the trans- 
formation experiments and relate to colonies descrilBi|^<;!Sf§s 
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Tib. 2 The size and approximate relative order in the HBV genome of some 
of the major fragments in various restriction enzyme digests (Fig, 1) of HBV 
DNA. The site for R.EcoRI is taken arbitrarily as a reference point for the 
circular map. The shaded area indicates the region labelled with in the 
endogenous repair reaction. 

Expression of HBV sequences 

A population of the colonies canying putative recombinant 
plasmids, selected on the basis of drug resistance and sensitivity 
characteristics, was screened for the production of HBV 
antigens. Three test systems were used in the solid phase method 
of radioimmunoassay described by Broome and Gilbert' which 
uses polyvinyl disks coated with IgG from specific antisera. 
These disks are placed in contact with cells producing antigen 
which binds to the IgG surface. When subsequently incubated 
with "'I-labelled IgG from the same antiserum, the bound 
antigen retains the label and can be detected readily by 
autoradiography. The sera used were anti-HBs (human and 
hyperimmune animal sera), anti-HBc (human sera containing 
HBsAg and HBeAg and selected on the basis of a high anti-HBc 
titre) and anti-HBc and anti-HBe together (human sera 
containing HBsAg and high levels of anti-HBc and anti-HBe). 

Clear positive results were obtained in the two test systems 
containing anti-HBc. Of some 350 colonies tested, 13 gave 
intense spots on autoradiography with the anti-HBc + anti-HBe 
antibodies (Fig. 4a). Most of these colonies remained strongly 
positive when re-tested in both the anti-HBc and anti-HBc + 
anti-HBe assay systems (Fig. 4*). None of the clones giving a 
negative response in the initial screening was subsequently 
positive, but some clones that were positive initially gave a 
negative result after subculturing, which probably reflects 
instabihty of some of the hybrid plasmids. All of the clones that 
were positive in the anti-HBc + anti-HBe system were positive 
when tested with anti-HBc alone, implying that none was 
producing detectable levels of HBeAg. If lysis of the bacterial 
colonies with phage A before radioimmunoassay was omitted, 
only a very faint outline of the positive colonies was discernible 
(Fig. 4c). This is consistent with the presence of the antigen as a 
periplasmic polypeptide fused to the major part of ^-lactamase 
(penicillinase) as anticipated. The sensitivity of this assay was 
shown to be comparable with that attainable with the same 
reagents in a solid phase microtitre well assay" by titration of 
10 (1,1 samples (diluted progressively from 1 in 400) of semi- 
purified HBcAg from human liver". 

The immunological specificity of the reactions was confirmed 
by the following observations. Replicate assays with different 
anti-HBc sera identified the same positively reacting clones. The 
positive reactions in the HBcAg assay were abolished if the 
polyvinyl disks were coated with normal human IgG instead of 
specific anti-HBc IgG, or if the normal human serum used as 
diluent for the "^I-labelled antibody was replaced by anti-HBc- 
positive serum from a different donor (by competition with 
excess unlabelled anti-HBc), or if "'I-anti-HBS replaced '"l- 
igf^ifHRfciE-tfee^assay with the anti-HBc coated disks. Finally, 
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Fig. 3 Electrophoretic separation in agarose gels 

of restTiction enzyme digests of recombinant a b i b- c d ^ j 

plasmids comprising pBR322 and HBV DNA 
sequences. The examples shown are of plasmids 
(Table 1) that elicit production of antigenic 
material that reacts specifically with antibodies to 
HBcAg (Fig. 4). The left-hand panels show DNA 
fragments revealed by staining with ethidium 
bromide and photography under UV light. After 
photography, gels were soaked in alkali for dena- 
tuiation of the DNA fragments which were then 
transferred to cellulose nitrate membrane filters" 
for hybridisation with "P-labelled HBV DNA 
and autoradiography; the right hand panels show 
the corresponding radioautographs. HBV DNA 
(-0.5 (ig) labelled with was prepared from 
40 ml clarified blood plasma from a donor with a 
high titre of HBsAg, serotype adyw as described 
in the legend to Fig. 1 and ref. 6. In one experi- 
ment 0.5 ikg E. coU DNA was added as a carrier, 
but in a second the carrier was omitted. The DNA 
wasdividedmtos.xt«rt^^^^^^^ 

Lmainder of tlii R.Ec/rI aniR.BamHI digests as well as R.Kpnl and R.Bg/If ^'ifts were used in termma trandera^ w,th the e.«pt on of the 





R Z"*Best1L"sampUs°w;«^^^ A exonuclease (L5 h at"0»C in 50 mM Na glycinate. pH9.5. 5 mM MgCl,, 50 ^em|-' bovine serum 

l^t?BlA) tallowed ^^^^ 

enzvmes Poly dOseauences were attached to the 3' termini by incubation with polynucleotide termmal transferase' (250 U ml 'for 10-20 mm at 27 Q 'n 15 ul 
ToSpoUs^ium Slate^ pH 7.0, 1 mM CoCl.. 1 mM dCTP. 50 ml"' BSA. and the reactions stopped^addition of excessEDTA. An »PP«^.mately molar 
eauiv^deronBR3?2 DNA d gested ^ith R.Psr and incubated in similar reactions with dGTP instead of dCTT (from Dr J. van den Berg) was then added and the 
in 50 mM Tris-HCl, pH 7.5, 100 mM NaCl. 5 mM EDTA, heated to 60 °C and gradually cooled to room temperature over a 
feriod^f about 5 h AHquots of the solutions {10 1.1) were then incubated with competent cultures (0. 1 ^1) of E. coli HB 101 prepared as described by Lederberg and 
Coren"andir^™bat1!lte 

filtei^ supported on appropriate agar plates to test their resistance or sensitivity to the two antibiotics and tor colony hybridisation with P;"^ ™V DNA 
from Dane particles. Colonies ^ving positive hybridisation reactions were grown in liquid culture and then shaken overnight at 37 C a^^er addition of 
cXamXnicol (170 aa ml"') to amplify the number of plasmids within the cells which were then collected by eentrifugation and treated with lysozyme and 
EDTA^f:^ r sl^^S^^ 

Sntaining ethidium bromide (200 ^g ml"). The plasmid bands were collected, extracted with propan-l-ol saturated with aqueous GCI dialysed. and samples 
CteTwVhrstrictioncndonUa:f3asdescribedintheiegend.oFig.l.Theresultsshownaretromtheto«^ 

in which a is the undigested plasmid and i, c and d are digests with R.P^t R./Tp/iI and R,B<imHI, respectively ,pHBV-l 1 0 in tracks ^. /, g and A, m whuch e s he 
undigested plasmid and /, g and h are digests with K.PsU R.BamHI and R.B<,mHI+ R.£e.RI, respectively; pHBV-139 in tracks , k. I m and «. m which , is the 
uSale ted p Lmid and i ? ™ and ™ are digests with R.P«, R.B«mI, R.£«RI, and R.S«mHI + R^EcoRI. respectively. Tracks , and p show reference digest 
DNA with R.E«RI + R.Hmdm- which also contained "P-labelled undigested DNA from Dane particles. The Pr.'ned ietters identify the sampks on the 
autoradiograph, made on Kodak X-omat H X-ray film with an intensifying screen, of the cellulose nitrate filters after hybndisation with HBV DNA. 



absorption of the radioactive anti-HBc with semipurified 
HBcAg^^, by overnight incubation at 4 "C followed by eentri- 
fugation to remove the excess antigen, abolished the positive 
result in the radioimmunoassay. Thus the detection of an as yet 
unidentified HB V-specific antigen by interaction with its specific 
antibody present in the sera of HBV-infected individuals 
remains a formal possibility, but it is unlikely, and the results are 
wholly consistent with the observed activity being that of 
HBcAg. The HBcAg polypeptide detected in the colonies, 
however,, is unlikely to be itientical with that occurring naturally 
for the cloning experiment was such as to produce polypeptides 
linked to jS-lactamase*. However, it has not been established 
that this is the case and it remains possible that translation of 
j3- lactamase sequences could be terminated and followed by 
reinitiation, but against this is the occurrence of clones that do 
not exhibit HBcAg activity, but which contain HBV DNA 
fragments similar to those that do. The point will be clarified by 
DNA sequence determination, which is in progress. All of the 13 
HBcAg-positive clones had been made using the R.Pst site in 
pBR322, five with fragments from R.BamHI digests of HBV 
DNA and eight from R.ApnI digests. Not all of the plasmids 
have been isolated tor analysis, but the three illustrated in Fig. 3 
were all from cells giving positive reactions for HBcAg. The 
smallest fragment of HBV DNA in these plasmids (Table 1) was 
about 0.95x10* daltons, but others with an HBV fragment 



about half this size also gave positive reactions for HBcAg; 
values reported^ for the MW of the naturally occurring HBcAg 
range from 17,000 to 80,000. 

In an equivalent assay for HBsAg, similar, but faint positive 
reactions were obtained with four clones which are being 
analysed further. One might expect detection of expression of 
serological activity to be more difficult whh HBsAg than with 
HBcAg as its protein moiety is markedly hydrophobic and hence 
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TaJile 1 Some properties of the plasmids shown in Fig. 3 



Hybrid plasmid 

pHBV-66 

pHBV-110 

pHBV-139 



MW of fragment 
excised by R.Fst 
xlO"* 

1.2 

0.95 

1.16 



Targets tor restriction enzymes 
within the HBV sequences 
R.£coRI R.SamHI R.^lual 



In all the plasmids, the site for R.BamHI within the HBV sequence is located 
about 0.7x 10* daltons from the R.Psi site near the R.EcoRi site. Board Assigned Page #1 839 



Fig. 4 Autoradiographs showing the detection by radioimmunoassay' of 
bacterial colonies expressing HBcAg. a. Four HBcAg-positive colonies 
amongst 52 colonies examined on one plate, t. The resuk obtained when 
initially positively reacting colonies, together with a random selection of 
colonies giving a negative reaction, were subcultured from stock plates and 
re-tested, c, A duplicate of b in which lysis of the colonies with phage A 
before radioimmunoassay was omitted. Colonies of the bacteria were grown 
at 37 °C overnight on Millipore filters supported on nutrient plates contain- 
ing tetracycline {to maintain selection for the plasmid). The ceBs were lysed 
by imprinting the filters for a few minutes on a lawn of bacteria confluently 
lysed with a virulent derivative of phage A and then incubating further for 
several hours at 37 °C. Polyvinyl disks coated with anti-HBc -I- anti-HBe 
specific human IgG (60 iigrol"') were placed face down on the colonies, 
which had obviously lysed, and incubated at 4 °C for 3-4 h. The disks were 
then removed and washed thoroughly and vigorously to remove the 
considerable quantity of adherent viscous bacterial debris. Finally the disks 
were incubated overnight at 4°C with homologous '"I-labelled IgG 
(10* c.p.s. per (ig, 2 X 10" c.p.s. per ml), washed thoroughly and autoradio- 
graphed on Kodak Blue X-ray film exposed for 2 d with an intensifying 
screen. 
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tends to remain associated with lipid", and some experiments 
(but not otliers) suggest that carbohydrate residues on HBsAg 
glycoprotein may be required for full serological activity ' . 

Conclusions and further implications 

A large number of hybrid DNA molecules comprising pBR322 
and various fragments of the HBV genome have been cloned 
and propagated in E. coli. It thus becomes possible to produce 
HBV DNA in the quantities required for detailed structural and 
sequence analysis and location of the various coding sequences 
in the viral genome. Such analysis with DNA from several 
independent isolates and clones will explain the basis of the 
heterogeneity found in DNA from Dane particles. The DNA 
will also be useful for further genetic manipulation related to 
studies of expression and as a source of a highly radioactively 
labelled probe for hybridisation experiments for both diagnostic 
purposes and fundamental studies. 

When inserted within a normal coding sequence of the E. coli 
plasmid the DNA from hepatitis B virus, which normally infects 
only man and apes, can be expressed to give serologically active 
translation products. This suggests that at least the region of the 
HBV genome coding for the amino acid sequence necessary for 
serological activity probably does not contain inserted 
sequences, or 'introns', which have now been found in a number 
of eukaryotic and viral genes''"^^, for it is widely beUeved, 
although not proved, that E. coli cannot process transcripts of 
these sequences. None of, the hybrid plasmids so far examined 
contains a target for R.Eco^l within the HBV sequences. It was 
thus possible to insert the entire plasmid into a derivative of 
bacteriophage A using this restriction site and plaques of the 
recombinant phage gave positive reactions in the disk radio- 
immunoassay (results not shown). The HBV sequences may well 
prove more stable when propagated within the phage genome, 
especially as a lysogen, and appropriate exploitation of the 
phage regulatory systems should permit significantly increased 
yields of the antigens from E. coli, raising the possibility of large 
scale antigen production for diagnostic purposes and develop- 
ment of vaccines. The insertion of HBV DNA into a phage^A 
derivative has been described very recently by Fritsch et al. . 
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A 3-s delay in an optical burst from 
X-ray burst source MXB1735-44 

The discovery of optical bursts from the X-ray burst source 
MXB1735-44 was reported in a previous articled One burst 
event was detected in both X rays and optical light. We showed 
(in Fig. 1 of the eariier article) that the optical burst is delayed by 
=1.5 s relative to the X ray burst. We concluded that to within 
the accuracy of the SAS 3 quick-look data timing (±1 s), the 
onset of the optical burst was coincident with that in X rays. A 
few months after the completion of our earlier paper, SAS 3 
production data became available. We here report a re-analysis 
of the arrival times of the X ray and optical bursts based on these 
data which contain the universal time to an accuracy of s5 ms. 
We now find {as shown in Fig. 1) that the optical burst is delayed 
significantly by 2.8 s. 

The optical observations were performed using the 1.5-m 
telescope at the Cerro Tololo Interamerican Observatory anda 
red-blocked RCA 31034 photomultiplier tube which had fflPan 



effective bandpass including the transmission of the atmosphere 
of 3,100-5,500 A (FWHM). The absolute time, which was 
maintained to within 10 ras of ur by monitoring WWV, and the 
optical data in 100-ms integrations were recorded on a digital 
tape. Simultaneous X-ray observations were made with the 
horizontal tube detectors aboard the SAS 3 observatory^. The 
three energy channels of interest are 3-6 keV, 6-12 keV and 
8-19 keV which have time resolutions of 0.42 s, 0.83 s and 
0.83 s, respectively. The X-ray production data are assigned 
absolute times in a three -step process. First, real time X-ray data 
and timing data from the spacecraft clock are tagged with the 
absolute time at the NASA ground station in Quito, Ecuador, by 
recording them simultaneously with the output of a UT caesium 
clock. Second, in the production analysis at Goddard Space 
Flight Center the real time data are used to assign UT to the bulk 
of the data which are played back from the on-board tape 
recorder. Third, corrections for the variable VHF propagation 
time from the satellite to the ground station are made using a 
da4sa^ep|h|aia|SS#J^S«rt)on tout days of radar-Doppler track- 
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ent with the binaural estimates for the 
same stimulus conditions. 

Although our measurements represent 
only a limited investigation of higher-fre- 
quency combination tones, we feel con- 
fident in concluding that, at least for low 
primary frequencies, such combination 
tones exist 20 to 40 dB below the primary 
levels. As others have suggested (2, 8), 
the difficulty in identifying their presence 
appCcirs to be largely due to the upward 
masking by the primary tones. Our ob- 
servations indicate that the relative level 
of higher-frequency combination tones 
can be quite high. Thus, even though 
they are not clearly audible, they may be 
of some importance in identifying the 
nonlinear mechanism from which they 
arise. In this respect, the presence of 
higher-frequency combination tones 
brings the psychoacoustic results into 
closer agreement with physiological in- 
vestigations of cochlear microphonic in 
which distortion components above and 
below the primary frequencies are rou- 
tinely measured (14). 

P. M. ZUREK 

R. M. Sachs 
Central Institute for the Deaf, 
818 South Euclid, 
St. Louis, Missouri 63110 
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Human Growth Hormone: Complementary DNA 
Cloning and Expression in Bacteria 

Abstract. The nucleotide sequence of a DNA complementary to human growth 
hormone messenger RNA was cloned; it contains 29 nucleotides in its 5' untrans- 
lated region, the 651 nucleotides coding for the prehormone, and the entire 3' un- 
translated region (108 nucleotides). The data reported predict the previously un- 
known sequence of the signal peptide of human growth hormone and, by comparison 
with the previously determined sequences of rat growth hormone and human cho- 
rionic somatomammotropin, strengthens the hypothesis that these genes evolved by 
gene duplication from a common ancestral sequence. The human growth hormone 
gene sequences have been linked in phase to a fragment of the trp D gene o/ Esche- 
richia coli in a plasmid vehicle, and a fusion protein is synthesized at high level 
(approximately 3 percent of bacterial protein) under the control of the regulatory 
region of the trp operon. This fusion protein (70 percent of whose amino acids are 
coded for by the human growth hormone gene) reacts specifically with antibodies to 
human growth hormone and is stable in E. coli. 
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Growth hormone, along with at least 
two other polypeptide hormones, cho- 
rionic somatomammotropin (placental 
lactogen) and prolactin, forms a set of 
proteins with amino acid sequence ho- 
mology and to some extent overlapping 
biological activities (7, 2). Since the 
genes of this set of proteins probably 
have a common ancestral origin (/), they 
constitute an excellent model to study 
the evolution, structure, and differential 
regulation of related genes. In addition, 
smce human growth hormone is of con- 
siderable medical importance and its 
supply is limited, the synthesis of growth 
hormone in bacteria might provide the 
required alternate source of this critical 
hormone. 

We have previously isolated and ana- 
lyzed bacterial clones containing copies 
of complementary DNA (cDNA) tran- 
scripts of messenger RNA's (mRNA's) 
for these hormones. The complete se- 
quence of rat pregrowth hormone 
mRNA (J) has been reported; in addi- 
tion, sequence data have been presented 
for fragments of about 550 bases com- 
plementary to part of the coding (amino 
acid residues 24 to 191) and 3' untrans- 
lated portions of human chorionic so- 
matomammotropin (hCS) (3, 4) and hu- 
man growth hormone (hGH) mRNA's 

(5) . A partial sequence of rat prolactin 
has been determined by Gubbins et al. 

(6) . These sequence data showed that, 
whereas the growth hormone genes of 
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the rat and man had significant homolo- 
gy, they also had diverged substantially, 
such that they differed more than the 
genes for the functionally distinct human 
hormones hCS and hOH. 

We now report the synthesis, cloning, 
and sequence analysis of cDNA contain- 
ing the entire coding and most of the 
noncoding portions of hGH mRNA. We 
also describe the insertion of these se- 
quences into an "expression plasmid" 
containing part of the Escherichia coli 
tryptophan {trp} operon whose construc- 
tion has been realized by Hallewell and 
Emtage (7). We describe the use of this 
plasmid to promote the inducible bacter- 
ial synthesis of high levels of a hybrid 
protein, 70 percent of which is composed 
of amino acids coded for by the hGH 
gene. 

Human growth hormone mRNA isola- 
tion. Polyadenylated RNA was isolated 
(8) from human pituitary tumors re- 
moved by transphenoidal hypophysec- 
tomy. To obtain an indication of the in- 
tegrity and the relative abundance of 
growth hormone mRNA in each sample, 
the individual mRNA preparations were 
translated in the wheat germ cell-free 
system, and the products were analyzed 
by electrophoresis on sodium dodecyi 
sulfate-polyacrylamide gels (Fig. 1). 
Among the translation products of the 
five acromegalic tumor RNA's {Fig. 1, 
lanes 1 to 5), the most prominent band 
corresponds to a protein of approximate- 
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ly 24,000 daltons. This protein is as- 
sumed to be human pregrowth hormone 
since it is similar in size to rat pregrowth 
I hormone (Fig. 1, "rat") and is precipi- 
,' tated by antiserum to hGH (data not 
shown). This assumption is further justi- 
1 fied by comparison with the translation 
products of polyadenylated RNA isolat- 
1 ed from bovine pituitary (Fig. 1, "cow") 
[ and from a human prolactin-producing 
) tumor (Fig. 1, lane 6). Both of these 
RHA's directed the synthesis of a pro- 
tein similar in size to human and rat pre- 
growth hormone, but also directed the 
synthesis of a larger quantity of a protein 
. of higher molecular weight, presumably 
preprolactin. The tumors show variation 
in the extent to which hGH mRNA is 
present, as measured by their trans- 
lational activities. Nevertheless, hGH 
mRNA appears to be the most abundant 
1 mRNA species in the acromegalic tu- 
I mors (Fig. 1, lanes 1 to 5). These results 
are consistent with and verify the clinical 
diagnoses made prior to surgery. 

Molecular cloning ofhGHcDNA. The 
polyadenylated RNA from the tumors 



that appeared to have the greatest abun- 
dance of hGH mRNA by the trans- 
lationa! assay were pooled for synthesis 
of double-stranded cDNA (Fig. 2). Por- 
tions of the double-stranded cDNA were 
analyzed by restriction endonuclease di- 
gestion before and after treatment with 
SI nuclease. A high proportion of the 
cDNA was about 1000 nucleotides long 
(Fig. 2, lanes a and b), the length ex- 
pected for hGH mRNA, assuming analo- 
gy with rat pregrowth hormone mRNA 
(5). Endonuclease Hae III digestion of 
the DNA generated a 550-base pair (bp) 
fragment (Fig. 2, lanes h and j) pre- 
viously reported to occur in hGH and 
hCS cDNA's {4, 5). The prominence of 
this band supports the idea that the 
cDNA is highly enriched in hGH gene 
sequences. This is further suggested by 
the finding of a fragment of about 400 bp 
generated by digestion with Hinf I and 
Sma I (Fig. 2, lane g). The fragment of 
about 500 bp generated by Pvu II (Fig. 2, 
lanes e and i) extends beyond the pre- 
viously cloned 550 bp fragment, which 
contains only one Pvu II site. However, 



its presence is predictable from the se- 
quence of rat growth hormone cDNA 
(i), and by the conservation between 
species of the aniino acid sequence in 
this region (9). The fragments of about 
350 bp and 150 bp generated by com- 
bined digestion with Pvu II and Bgl II 
(Fig. 2, lane f) would also be anticipated 
from the previously determined struc- 
ture of the 550 bp hGH fragment and 
knowledge of the existence of the addi- 
tional Pvu II site. Therefore, this cDNA 
preparation appears to be highly en- 
riched in full-length copies of hGH 
mRNA. 

The uncleaved cDNA was cloned in 
the plasmid pBR322 and£. coli xl776 in 
a P3 physical containment facility {10) by 
methods similar to those previously de- 
scribed (J). Briefly, the cDNA was first 
treated with SI nuclease and sub- 
sequently with DNA polymerase I in the 
presence of the four deoxynucleoside 
triphosphates to generate blunt-ended 
cDNA molecules. Synthetic DNA con- 
taining the site for the restriction endo- 
nuclease Hind III was then added to 





Human 

Fig. 1 (left). Translation products of mRNA isolated from growth hormone and prolactin-producing tumors and from bovine pituitary. These tis- 
sues were stored in liquid nitrogen shortly after their removal until preparation of the RNA. A portion of polyadenylated RNA isolated from each 
human tumor, the bovine pituitary, and the cultured rat pituitary tumor (GC) cells was used as a messenger in the wheat germ cell-free protein 
synthesis system (30). The ^'S-labeled proteins were analyzed by electrophoresis and autoradiography on sodium dodecyl sulfate-polyacrylamide 
gels (12.5 percent) (20). The translation products from five acromegalic and one prolactin-secreting tumor are shown in lanes 1 to 5 and lane 6, 
respectively. The lanes labeled "cow" and "rat" show the translation products from bovine pituitary and rat GC ceil RNA. Lane 7 shows the 
bacteriophage T4 proteins (2/) used as molecular weight markers. The arrows indicate the bands corresponding to pregrowth hormone (pGH) and 
preprolactin (pPL). Fig. 2 (right). Analysis of cDNA synthesized from mRNA extracted from the growth hormone-producing pituitary tumors. 
The polyadenylated RNA from tumors two, four, and five were pooled (135 /ig) and used as a template for synthesis of ™P-labeled double- 
stranded cDNA as described (i). Samples of this cDNA were cleaved with various restriction endonucleases before and after SI nuclease diges- 
tion. The figure shows an autoradiogram of the resulting DNA fragments after electrophoresis on a 4.5 percent polyacrylamide gel (i). (Lane a) 
Uncleaved cDNA before SI digestion; (lane b) uncleaved cDNA after SI digestion; (lane c) bacteriophage fd DNA, Hpa II (molecular weight 
markers); (lane d) cDNA, Pst I + Bgl II; (lane e) cDNA, Pvu II; (lane f) cDNA, Bgl II + Pvu II; (lane g) cDNA, Hinf I + Sma I; {lane h) cDNA, 
Hae III; (lane i) cDNA, Pvu II; (lane j) cDNA, Hae III; Gane k) bacteriophage fd DNA, Hae III (molecular weight markers). 
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each end of the cDNA, and cohesive 
ends were generated by digestion with 
endonuclease Hind III. The resulting 
cDNA was purified on gel and ligated to 
Hind Ill-cut and bacterial alkaline phos- 
phatase-treated pBR322 plasmid DNA 



(5). Bacteria were then transformed with 
this recombinant plasmid. Colonies with 
recombinant DNA containing plasmids, 
selected by antibiotic resistance (tet^amp^) 
were grown, and the plasmid DNA's 
were isolated. The DNA was digested 



with Hind III, treated with various 
Other restriction endonucleases, and 
analyzed by gel electrophoresis. One 
clone contaftied an insert of about 800 bp 
whose digestion by Hae III, Pvu H, and 
Bgl II generated fragments similar in size 
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Fig. 3. Nucleotide sequence of hGH mRNA and the amino acid sequence of human pregrowth hormone. The sequence was determined according 
to the procedure of Maxam and Gilbert (,11) from 5'- or 3'-end-labe!ed restriction fragments of chGH800/pBR322. The sequence between the two 
internal Hae III sites was taken from our previous work (5). Most of it, and all of the sequence outside these two internal Hae III sites was 
resequenced by the chain-termination technique (72) as described in detail elsewhere (22) using as single-stranded templates the chGH800 
Hind III fragment recloned in the vector M13 mp5 and as primers restriction fi-agments of chGH800/pBR322. The RNA sequence has been taken 
from the DNA sequence. The amino acid sequence has been deduced from the RNA sequence using the genetic code. The termination codon, 
UAG, is designated by the symbol AM for "amber." 



TRANSLATION 

HUMAN GSOWTH HORMONE 
tlsD IjigdIIIUNKtR -26 -25 

Glu Ala Pro Ser teu Gly lie Mst- Ala 

5' — ^l^lfefciaaSCi' CCA I AGC TIG GOG *TC , At© 6CT— 3 
-Cft CftA- GGT TCG aUc CCt WW- -.■;, .*•«, COA, 



I , ^ 

(po) La t!aE tlBPy / HGH T6t_ 

^-^--^ 

liiBdIII BginHI Pstl Hind 111 BgmHI 

Fig. 4. Codon usage in hGH mRNA. The numbers indicate how many 
times the codons are used in the region of hGH mRNA coding for the 
prehormone; OC, OP, and AM designate the stop codons ochre, opaS. 
and amber, respectively. Fig. 5. Postulated nucleotide sequence 
around the Hind III site in the hybrid gene of expression plasmid 
prrpEDSO-chGHSOO. Plasmid pf;-pED50 was constructed from pfrpED5-l (7) by linearizing the plasmid with Hind III, filling in the protniding 
5' ends with the use of DNA polymerase I (Klenow fragment from Boehringer) and ligattng synthetic decamers containing a Hind III site (col- 
laborative research) to the blunt-ended material (3, S). After digestion with Hind III the plasmid was separated from residual linker molecules by 
chromatography on Sephadex G-200, recircularized with T4 DNA ligase, and used to transform £. coU W3110trpoETlC2i) by a standard proce- 
dure (24). Plasmid DNA isolated from one of these colonies was digested with Hind HI and treated with alkaline phosphatase (4, 8). A portion 
{5 /ig) of this DNA was end-labeled with [y-^Pl-ATP with the use of T4 DNA kinase (Boehringer) and cut by Hae III. The DNA sequence of the 
labeled fragments was determined by chemical cleavage (Jl), after they were isolated by polyacrylamide electrophoresis. The cloned chGHSOO 
DNA was cleaved from pBR322 with Hind III and isolated by polyacrylamide ge! electrophoresis. This DNA was ligated to similarly cleavesl 
and alkaUne phosphatase-treated prrpED50. The ligation mixture was used to transform E. coli strains W3110 r/poEVl and RRl (25) in a K 
facility, and transformants resistant to ampiciUin were selected. Resistant colonies were examined for the presence of Inserted chGHSOO 
sequences by gel analysis of such plasmids after digestion with restriction endonucleases Bam HI and Pst I. Plasmids with the growth hormone 
initiator codon proximal to the f/pD gene sequence showed bands of 250 and 900 bp, whereas plasmids with the inserted cDNA in the opposite 
orientation showed bands of 250 and 350 bp. 
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to those from the 550 bp hGH cDNA 
clone {data not shown) and to the di- 
gested uncloned cDNA (Fig. 2). This 
suggested that this clone did contain 
cjDNA complementary to full-length or 
nearly fuU-length hGH mRNA. (This 
clone is designated chGH800/pBR322.) 

Sequence analysis of cloned DNA. 
The nucleotide sequence of the cloned 
DNA was determined by the chemical 
cleavage method of Maxam and Gilbert 
(//) and the chain-tennination technique 
of Sanger, Nicklen, and Coulson (12). 
The hGH mRNA sequence and the cor- 
responding amino acid sequence of hu- 
man pregrowth hormone can be derived 
from the DNA sequence (Fig. 3). The 
amino acid sequence determined from 
the DNA sequence is consistent with the 
known amino acid sequence of hGH (13) 
with the following exceptions: the DNA 
sequence predicts glutamine, aspara- 
gine, glutamine, glutamic acid, gluta- 
mine, aspartic acid, asparagine, and 
glutamine at amino acid positions 29, 47, 
49, 74, 91, 107, 109, and 122, respective- 
ly, while the protein sequence indicates 
glutamic acid, aspartic acid, glutamic 
acid, glutamine, glutamic acid, aspara- 
gine, aspartic acid, and glutamic acid. It 
is likely that the DNA sequence is cor- 
rect in this regard since it is sometimes 
difiicult in protein sequence analysis to 
differentiate aspartic acid from aspara- 
gine and glutamic acid from glutamine. 
The amino acid sequence of the signal 
peptide portion of human pregrowth hor- 
mone had not been previously deter- 
mined and is deduced from the mRNA 
sequence. If translation begins with the 
methionine codon "in phase," 26 codons 
proximal to the first amino acid of 
growth hormone (Fig. 3), then the prima- 
ry translation product of hGH mRNA 
would be a protein of 24,851 daltons, a 
value in agreement with the cell-free 
translation data shown in Fig. 1. 

A comparison of the amino acid and 
nucleic acid sequence homologies be- 
tween rat growth hormone, hGH, and 
hCS and their respective mRNA's is 
shown in Table 1. In the coding regions, 
there is higher homology between the 
nucleic acid sequences than between the 
amino acid sequences. This difference is 
consistent with the already mentioned 
view that the genes of these related hor- 
mones evolved from a common evolu- 
tionary precursor gene, and is further 
supported by the marked homology in 
the 5'-noncoding portions of the 
mRNA's for rat and human growth hor- 
mone. (Data for the 5'-noncoding region 
of hCS are not yet available.) Human 
growth hormone has more homology 
with hCS than with rat growth hormone, 
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Table 1. Amino acid and nucleic acid se- 
quence homology of growth hormone, cho- 
rionic somatomammotropin, and their mRNA's. 
Data for rat growth hormone (rGH) and 
human chorionic somatomammotropin (hCS) 
and their mRNA's are from (3. 4). For hCS, 
only data for amino acids residues 24 to 
191 (and the corresponding portion of the 
mRNA) and a portion of the noncoding 3'-re- 
gion corresponding to the cioned 550 bp frag- 
ment are used for comparisons, since data for 
the other portions are not available. The 
amino acid sequence of the prepeptide portion 
of hCS was determined by Sherwood et al. 
Q7). 



Homology 
(percent) 



Source 



hGH 
versus 
rGH 



hGH 
versus 
hCS 



Nucleic acid: 
5'-Noncoding 
Presequence 
Coding 
3'-Noncoding 

Amino acid; 
Presequence 
Coding 



73 
76 
76 

38* 

58 
67 



92 
94 

84 
86 



*The homology in this region can be increased to 55 
percent by adding appropriate gaps in the se- 
quences. This procedure also reveals that there is a 
homology of 27 out of 30 bases in the region of the 
AAUAAA {28). Similar conservation between spe- 
cies but with different sequences are found when hu- 
man and rabbit ^-globio (29) and human and rat in- 
sulin (JO) are compared. 



especially for the 3'-noncoding portions. 
This finding supports the hypothesis de- 
veloped earlier that the chorionic so- 
matomammotropin and growth hormone 
genes probably evolved by a gene dupli- 
cation mechanism (/) at some time after 
the separation of the human and rat spe- 
cies. In addition, the fact that both hor- 
mones exist in both species implies that 
the same hormones may have evolved 
independently more than once. 

Figure 4 shows the codons used for 
hGH mRNA. As is the case with rat 
growth hormone (3) and hCS (6) 
mRNA's, there is a nonrandom selection 
of codons. This appears to be mostly due 
to the preference for G (guanine) or C 
(cytosine) over A (adenine) or U (uracil) 
for the third position of the triplet codon. 
This is also the case with most (i, 14) but 
not all (14) eukaryotic mRNA's whose 
structures are known. 

Construction of a plasmid for growth 
hormone expression. To see whether 
hGH gene sequences can be expressed in 
bacteria, we used the plasmid p//-pED5- 
1 (7), which contains the regulatory re- 
gion l(po)La], the first gene (trpE), and 
15 percent of the second gene (trpXi) of 
the E. coll trp operon. Cells containing 
pfrpEDS-l normally synthesize small 
amounts of trp gene products. However, 
if trp operon transcription is derepressed 



by addition of 3/3-indolylacrylic acid, 
synthesis oitrp gene products increases, 
so that within 3 hours trp proteins ac- 
count for about 30 percent of the total 
cellular proteins (7). We hoped that by 
placing the hGH gene sequence under 
control of the trp operon, not only would 
it be expressed, but a higher level of 
hGH production could be obtained than 
was previously achieved with rat growth 
hormone gene sequences under control 
of the j3-lactamase gene (15). 

The hGH sequences from chGHSOO 
were inserted at the Hind III site of the 
frpD gene sequence as described in the 
legend to Fig. 5. In order to insert the 
hGH codons in phase with those of the 
trpD sequence, the Hind III site in the 
trpTi gene was manipulated in such a 
way as to shift the reading frame of any 
DNA inserted through the Hind III site 
of the plasmid by one base. To do this, 
p/rpED5-l was cleaved with Hind III, 
the protruding 5'-ends "filled in" with 
the use of DNA polymerase I (Kienow 
fragment) and synthetic DNA decamers 
containing the Hind III site ligated to the 
blunt-ended material. This DNA was 
then digested with Hind III to produce 
new cohesive Hind HI ends, and plasmid 
molecules were recircularized with the 
use of DNA ligase after the residual 
Hind III linker molecules were removed 
on a Sephadex (G-200) column. This ma- 
terial was used to transform E. coli and, 
after selection for ampiciUin-resistant 
colonies, the plasmid DNA was isolated 
from one of the transformants. The DNA 
sequence at the Hind III site of the new- 
ly constructed plasmid (designated 
pfrpED50) was determined and showed 
that the enzymatic reactions had ahered 
the reading frame as predicted. In 
this way, when chGH800/pBR322 was 
cleaved with Hind III and the hGH se- 
quences ligated to the Hind III site of the 
newly constructed plasmid, the codoris 
of hGH would be in phase with those of 
the trpXi gene, provided that the hGH 
gene sequences were inserted in the 
proper orientation. This was achieved by 
obtaining several clones, isolating plas- 
mid DNA, and determining the orienta- 
tion of the cloned segment by restriction 
endonuclease analysis (Fig. 5, legend). 

As is indicated in Fig. 5, the construc- 
tion of the hGH "expression" plasmid 
was such that the anticipated product 
would be a fusion protein containing the 
NHa-terminal region of the trpT> protein, 
amino acids coded by the 5 '-untranslated 
portion of hGH mRNA, the 26 amino 
acids of the signal peptide, and all of the 
amino acids of hGH. 

Synthesis of growth hormone in bac- 
teria. To determine whether the newly 
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constructed gene can direct the synthesis 
of large amounts of a new fused poly- 
peptide, and whether its expression is 
regulated by the trp promoter, cells con- 
taining the expression plasmid were 
derepressed for trp transcription, and 
proteins were labeled for 5 minutes with 
"C-labe!ed amino acids. Figure 6 shows 
an autoradiogram of sodium dodecyl sul- 
fate-polyacrylatnide gei of such proteins 
labeled at various times from 0 to 4 hours 
after induction of the trp operon. Two 
proteins (53,000 and 32,000 daltons) 
seem to be specifically derepressed by 
the inducer. The higher molecular weight 
protem is the trpE gene product (7). The 
32,000-dalton protein has approximately 
the anticipated size for the trpD-hGH fu- 
sion protein (34,000 daltons). It ts immu- 
noprecipitated by antiserum to hGH 
(Fig. 6, lane a) but not by the control 
antiserum (Fig. 6, lane b), and precipi- 
tation can be blocked by a large excess 
of hGH (Fig. 6, lanes c to h). Some of the 
trpE protein is immunoprecipitated by 
antiserum to hGH, but the amount is less 
when an excess of competitor hGH is 
added or control antiserum is used. This 



0-51234 a b 




27- 



24- 



result might be expected for two rea- 
sons. Precipitation of trpE protein by 
control antiserum may be due to the high 
abundance of this protein. More inter- 
estingly, specific precipitation of frpE by 
hGH antiserum (Fig. 6, lane c) and 
blockage of precipitation by an excess of 
competitor hGH (Fig. 6, lane e) may be 
the result of association of the trpE pro- 
tein and the /rpD-hGH fiised poly- 
peptide. The trpE and trpD proteins are 
normally associated in E. coli as a tetra- 
mer containing two subunits of each pro- 
tein {16); the resulting enzymatic activity 
(anthranilate synthetase) requires the 
trpE protein and the NHa-terminal 30 
percent of the trpD protein (,17). Thus, 
the fused //-pD-hGH protein may contain 
those trpT> residues required for binding 
trpE. All of these lines of evidence sug- 
gest that the 32,000-dalton protein is a 
fused frpD-hGH polypeptide. 

On the basis of the relative quantity of 
radioactivity incorporated into the trpD- 
hGH gene product, the fusion product 
appears to be a major protein made by 
the bacteria, constituting 3 percent of the 
total bacterial protein synthesis. Thus, 



c d 6 f g h 




the natural hGH gene sequence can be 
expressed at a high level in bacteria. 

The trpE and trpD protein molecules 
ordinarily accumulate at a similar rate 
(molar ratio 1:1) when the trp operon is 
induced (7). liowever, the molar ratio of 
trpE to (rpD-hGH is about 6:1, in- 
dicating that synthesis of f/pD-hGH is 
only 17 percent of the expected level. 
The reduced level of synthesis is not due 
to instability of the fused polypeptide 
since a "pulse-chase" experiment has 
shown that no significant degradation of 
the fused polypeptide occurs during the 
60-minute period after incorporation of 
the label (data not shown). There is some 
evidence that the chick ovalbumin pro- 
tein is also synthesized in£. coli at lower 
levels than expected (/S). 

Hypopituitary dwarfism . is a fairly 
common disease treatable only by re- 
placement with hGH (19). Growth hor- 
mone may also be useful in the treatment 
of other disorders. However, the poten- 
tial uses of this hormone have not been 
adequately investigated because its only 
source is pituitaries from human cadav- 
ers. In order to have an adequate supply 
of this hormone, it is necessary to find 
alternative means of producing it; the 
synthesis of hGH in bacteria may pro- 
vide such a means. 

Joseph A. Martial 
Robert A. Hallewell 
John D. Baxter . 
Howard M. Goodman 
Howard Hughes Medical Institute 
Laboratories, Department of Medicine, 
Metabolic Research Unit and 
Department of Biochemistry and 
Biophysics, University of California, 
San Francisco 94143 
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Changed Lyotropic Liquid Crystalline Structure Due 
to Polymerization of tlie Amphiphilic Component 

Abstract. Optical patterns in polarized light and x-ray reflections in the low-angle 
region were used to detect a shift from one liquid crystalline structure to another 
during polymerization. The polymerization took place in a lyotropic liquid crystal of 
water and sodium undecenoate, with a structure consisting of cylinders in a two- 
dimensional hexagonal close packing. After polymerization, a lamellar liquid crys- 
talline structure was obtained. 



The comprehensive knowledge of liq- 
uid crystalline structures (/^) that has 
been obtained through their use for dis- 
play systems and the excellent proper- 
ties of Kevlar fibers drawn from liquid 
crystals (5) have resulted in a recent fo- 



cus on polytnerization in systems of liq- 
uid crystalline character. The contri- 
butions so far have mainly been in the 
area of thermotropic liquid crystals (6); 
the equilibrium problems being more 
complex in lyotropic structures, which 



by definition are multicomponent systems. 

Early attempts to polymerize in lyo- 
tropic liquid crystals (J-IO) have re- 
cently (//, 12) been criticized as achiev- 
ing retention of the long-range order by 
freezing the structure with cross-cou- 
pling agents rather than forming an equi- 
librium Hquid crystal. The conditions for 
polymerization with retained liquid crys- 
talline structure have been stated (J/) by 
comparison with homotropic polymer- 
ization in crystalline structures {12-15). 
For a liquid crystalline structure the 
translational entropy component in the 
expression for the total free energy 
should also be included. Its magnitude is 
difficult to estimate; for microemuisions 
{16) the contribution of the entropic free 
energy is similar in magnitude to the con- 
tributions of other components of the 
free energy. 

The fact that polymerization with re- 
tention of the liquid crystalline structure 
may be a rare phenomenon in lyotropic 
liquid crystals (7/) encouraged us to 
choose an alternative route: to observe 
structural changes during polyinerization 
in a lyotropic liquid crystalline matrix. 
We are now able to report a change from 
one liquid crystalline structure to anoth- 
er during polymerization of the amphi- 
philic component. To our knowledge, 
this is the first report of such a change. 
It is essential to realize that the struc- 
ture obtained represents the stable 
conformation of the system; there is 
no "freezing in" of a structure— it 
forms spontaneously from an isotropic 
meh. 

The components of the liquid crystal 
were distilled water (with Q.05M ammo- 
nium persuifate as an initiator) and so- 
dium undecenoate. They formed a liquid 
crystal in the concentration range 47 to 
59 percent sodium undecenoate (by 




Fig. !. (A) optical pattern of the liquid crystalline phase before polymerization, typical of a structure of hexagonally close-packed cylinders. (B) 
Optical pattern of the polymerized sample. 
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Direct expression in Escherichia coli 

of a DN A sequence 
coding for human growth hormone 

Oavid V. Goedde.* Herbert 

KeHcM Itakurat, Darnel G. Yansura , micnae 

Roberto Creat & Peter «• ^,.^,s,.b„^ 

gaiaaS^-^hicken ovalbumin^ ') in Escherichia cob which 
S 1 SeTain antigenic activity, but which cannot be easily proces- 
Sd give the'natural gene P-d-t. Howev^^^^^^^ 
mouse dihydrofolate reductase cDNA into G-taiiea, i-sa- 
Seaved PBR322 resulted in the expression of a protein which 
Se?ed to be dihydrofolate reductase^ This protein conM 
ha?e been derived from the proteolytic cleavage of a 
Sniase^ihydrofolate reductase fused protein or from the 
dkecTinitiation of translation at the dihydro o ate reductase 
XtG (AUG start codon. Neither of these possibilities could be 
iied on for HGH expression. /« P'^^.^^ly^rST iSrw; 
fused bacteriaily produced protein «g» ^ JSoI 

unlikelv and initiation of translation at an HGH start coaon 
Touldb; expected to yield the pre-hormonecontaimngan extra 

26 amino adds. The ^temative approach of complete chemi^ 
synthesis of genes, although shown to be extremely effechve for 
the bacterial production of the human peptides somatostatin 
and^nsulin'^wouldbeverytimeconsumingforapolypeptideas 

large as HGH. 



DNA coding for human growth hormone was constructed 
by using chemically synthesised DNA in conjunction mth 

eZmLally prepared cDNA, J^'^'^'f ffl la 
expressed in Escherichia coli under the control of he lac 
prZoter. A polypeptide was produced having the size^d 
immunological properties characteristic of mature human 
growth hormone. . 

HUMAN GROWTH HORMONE (HGH) is a ^ofl^i^V"^ 
amino adds which is synthesised in the J^ff^^^^^^^ 
pituitary.Growth inhypopituitary dwarfs, whosesnids^tur^^^^ 
LtoadeficiencyofHGH,canberestoreddur,ngchadh^^^^^ 

administration of this hormone'. In addi ion, HGH inay prove 
S?iiveinthetreatmentofavarietyof^bnents,includingbone 

fraSres skin burns and bleeding ulcers^ As growth hormone is 
spS speSc, human cadavers have been the only source of 

^?he primary translationproduct of growthhormone mRNAis 
aorecursor protein consisting of a signal peptide attached to the 
N nul of growth hormone^ Such signal or 'pre' sequences 
are cSSeristic of secreted proteins. In the case of rat growth 
hormone SgH), DNA sequencing of cDNA P^^P/^d &orn 
RGH mRNA has identified the 26 ammo a«='d residues o^^^^ 
pre-sequence^ From information derived from the cUNA 
seauence of human growth hormone' we have designed and 
consS^Sed a bacterid plasmid which instructs the synthesis of 
substantial quantities of mature HGH m a microbial cell. 



HGH gene assembly strategy 

The general approach used for the bacterial synthesis of HGH 
iTvolves a combination of the cloning of the complementary 
SnA (CDNA) prepared from pituitary m^A with the cloning 
of chemically synthesised DNA. The ^P«f «;f ^^^^^S^^' 
on the known restriction endonuclease pattern of HGH cDN A ^ 
Haelll sites are present in the 3' nona>^ng regon and^^^^^^^^^ 
sequence coding for amino acids 23 and 24 of HGH. Treatment 
of double stranded (ds) HGH cDNA with Haelll fves a DNA 
fracment of 551 base pairs which includes codmg sequences for 
So adds 24-191 o'f HGH. We planned to clone this cDNA 
fragment and to clone separately a chemically synthesised DNA 
'aSor' fragment containing an ATG initiation codon and 
coding sequences for residues 1-23 of HGH. These mo DNA 
fralente would be combined to form a synthetic-natural 
'hybrid' gene. When inserted into a plasmid downstream from a 
suitable Lcterial promoter and ribosomebmdings^^^^ 
could be expected to direct the synthesis of fMet-HGH. The ac 
that most bacterial proteins do not co^^m N-terminal 
methionine residues suggests that the fMet should be efficiently 
removed resulting in the direct expression of 

Sere are several advantages of this approach over con- 
ventional expression methods which utihse eirtier chemically 
syXSed DNA or cDNA exclusively. The cDN/BeK^!Pi»«&ig 



Construction and cloning of plasmids 
containing HGH DNA sequences 

SSssr«dM^ 

S in S cSnA. The 12 indicated deoxyoli^n^^^^^^^^^ 
(Fig. la) were synthesised by the improved P^o Photnes^^^ 
method essentially as described previously for insulin . Figure 
Tfc ummStLteps 

these DNA fragmentsinpBR322 (ref. 15). Many ransformants 
of S S 294 (^f . 16) were obtained «ith plasmids contamin 
EcoRI-M«dIII inserts of approximately the desired sn«. Tlie 
nucleotide sequences of three of these inserts ^^re d»ned 
One oHGH3, had the expected DNA sequence. The other two 
?ad a deletion of the A^ base pak at the third nucleotide 
Dosition for the codon of amino acid 5 (Pro). ■ ■ .1,^ 

^ ^ construction of the plasmid (pHGH31) contaim"S 
H^elll cDNA fragment of HGH is outhned m Fig- 2 . To«a 
Bolv(A)-mRNA from human pituitaries was used in the 
?y^£Sis of ds-cDNA by RNA-dependent DNA po'^^^^^^^^^^ 
(Lerse transcriptase) and DNA polymery J He^^^^^^ 
ment. The cDNA was treated with Haelll restricflon 
Endonuclease and DNA of about 5 50 base pairs P"nfied by 
gel electrophoresis. This DNA fraction was h^n tailed with 
deoxyC residues using terminal deoxynudeotidyl transferase 
and cloned in pBR322 DNA which had been cleaved wi^ Psd 
Ld extended with deoxyOresidues. This procedure resto^st^^^ 
Haelll restriction sites on the cDNA and regenerates PstI sites 
ft both ends of the insert. Approximately 200 ^an^f ™^ rf 
E coli ^1776 (ref. 20) were obtained from Ing of C-tailea 
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ri£. 1 Design and molecular cloning ot chemicaily syothesised DNA 
coding for the first 24 amino acids of human growth hormone, a. The 
synthetic plan. The 5' ends have single-stranded cohesive termini for the 
EcdBl and Mndlll restriction endonudeases to facilitate plasmid con- 
struction. The methionine codon at the left end provides a site for initiation 
ot translation. Twelve different oligonucleotides, varying in size from 
undecamer to heiadecamer, were synthesised by the improved phoso- 
photriester method". These oligonucleotides, Ui to tJ* and Li to Lj, are 
indicated by arrows, b. Assembly of chemicalij synlhesised DNA fragments 
and construction of plasmid pHGH3 containing a 84-base pair insert codmg 
for HGH amino acids 1-24. AUquots of 10 ng of Uj-Us »nd Lj-L^ were 
phosphorylated using T^ polynucleotide kinase and [v-"PlATP (NEN) by a 
procedure of D.V.G.'^ Three separate T* ligase-catalysed joining reactions 
were performed; 10 ng ot 5'-OH fragment was combined with the 
phosphorylated U^, Lj and U; phosphorylated U,. U^, U and U were 
combined; and 10 (ig of 5'-OH fragment L, was combined with the ph<B- 
phorylated L,, U, and Uj. These ligations were carried out at 4 °C for 6 h in 
300 iJ of 20 mM Tris-HCl (pH 7.5). 10 mM Mgaj, 10 mM dithiothreitol, 
0.5 mM ATP using 20 units of Ugase. The three ligation mixtures were 
then combined, 20 units of T^ Ugase added, and the reaction aUowed to 
proceed tor 12 h at 20 °C. The mixture was ethatiol-preeipitated and elec- 
trophoresed on a 10% polyacrylamide gel. The band migrating at 84 base 
pairs (bp) was sliced from the gel and eluted'*. Plasmid pBR322 (ref. 15) 
(1 (ig) was treated with EcoRI and ifimflll, the large fragment isolated by gel 
electrophoresis and ligated to the syntheticDNA'^ This mixture was used to 
transform E. coli 294 (ref. 16). The EcoRI-Zfindlll insert of pHGH3^ had 
the correct sequence as determined by the Maxam-Gilbert method . 



sequences by the Grunstein-Hogness hybridisation procedure 
using "P-labelled human chorionic somatomammotropin 
cDNA^^ as probe. Seven colonies hybridised specifically with 
the probe, which is nearly identical to HGH in nucleotide 
sequence'. Three had identical Jfaelll inserts of approximately 
550 base pairs containing Smal, BgUl, PsO. and PvuH sites, all 
characteristic of HGH cDNA*. Sequence analysis of the insert 
from one of these clones, pHGH31 (Fig. 3) gave only one 
discrepancy (CUG versus CUA codon for leucine 101) when 
compared with the previously determined cDNA sequence . 
The amino acid sequence derived from the cDNA sequence 
differs from the published amino acid sequences of Li at 
positions 74 (Glu versus Gin), 107 (Asp versus Asn) and 109 
(Asn versus Asp). Although we cannot rule out post-tran- 
scriptional modification, the discrepancies probably stem from 
shortcomings of earlier protein sequencing methods in the 
identification of amidated ammo acids. 

Construction and cloning of HGH 
expression plasmids 

To express directly HGH in E. coli, a plasmid (pGH6) having 
two lac promoters was constructed as follows. A 285 -base pair 
EcoRI fragment containing two 95-base pair UV-5 lac 
promoter fragments separated by a 95-base pair heterologous 
DNA fragment was isolated from the plasmid pKB268 (ref. 24) 
and inserted into pBR322 at the EcoRi site. A plasmid (pGHl) 
was isolated with the promoter orientated to initiate tran- 
scription in the direction of the tetracycline resistance (tc^) 
gene. The EcoRl site distal to the tc^ gene was destroyed by a 
previously outlined procedure". 

Figure 4 reviews the steps involved in assembling the two 
separately clonedHGHDNAfragments. Treatment ofpI^HS^^Jntheg^ur^^^^^^ 



with BcoRI and HaeSil gives a 77-base pair fragment with an 
EcdRl cohesive end and a HccIII blunt end (also see Fig. la). 
This fragment, which codes for HGH amino adds 1-23, was 
isolated from a polyacrylamide gel. The plasmid pHGH31 was 
cleaved with Haelll and the 55-base pair HGH cDNA fragment 
isolated by gel electrophoresis. Xmal treatment of this fragment 
yielded a 512-base pair fragment coding for amino acids 24-191 
of HGH with one Haelll blunt end and one Xmal sticky end. 
The ligation of these two fragments resulted in the formation of 
high molecular weight DNA. Treatment with Smal (which 
recognises the same sequence as Xmal but leave flush ends) and 
EcoRI converted the ligation products to three distinct frag- 
ments: a 154-base pair dimer of the 77-base pair DNA frag- 
ment, a 1,024-base pair dimer of the 512-base pair fragment, 
and the desired 591 -base pair DNA coding for the entire HGH 
sequence. This latter fragment was purified by gel electro- 
phoresis and inserted into pGH6 between the EcoRl and the Si 
nuclease-treated Hindlll sites. Following transformation of 
Arl776, the expression plasmid pHGH107 was identified by 
colony screening and restriction analysis as described in Fig. 3. 
The HGH DNA sequence of pHGH107 was verified by the 
Maxam-Gilbert procedure". 

In our plan to express HGH two base pairs were included 
between the EcdRl AATT 'sticky' end and the ATG codon of 
the chemically syntiiesised portion of the HGH coding sequence 
(see Fig. 1). Therefore, in the plasmid pHGH107, 11 base paurs 
separate the lac AGGA ribosome binding site^* and the ATG 
translational start for HGH: 



5'. 
3', 



. AGGA A AC AG[A ATT C T AT G . . . 
.TCCTTTGTCTT AAlGAT AC . .. 
■urally occurring lac system the separation between 
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ribosome binding site and initiation codon is 7 base paiis. The 
exact lac sequence in this region was restored as foUows. The 
JSmid was'cleaved with Eco^, treated -Jh S nuckase o 
digest the single stranded ends, ^^^^ ':ectosed by blu«-end^^^^ 
ligation. The plasmids from three individual transformants of 
" 1776 were analysed. In ail instances the EcoRM^ibetn 
removed and a new A/hI site created in the plasmid (pHGHlOT- 

1). 

5'...AGGAAACAGjCTATG... 

3'...TCCTTTGTC1GAT AC... 

This structure, in which the ATG codon is now seven nucleo- 
tides downstream from the ribosome binding site^ might be 
expected to increase the efficiency with which HGH mRNA is 
translated. 

HGH radioimmune activity in 
bacterial extracts 

Extracts of E. coU a:1776 containing the expression plasmids 
were tested for HGH expression by a direct radioimmunoassay 
fRIA) (Table 1). Easily detectable amounts of HGH were 
produced by bacteria containing pHGH 107 or the derivative of 
PHGH107 lacking the EcoRl site (pHGH107-l). Contrary to 
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F* 2 Construction of a plasmid (pHGH31) S^^^^J 
fo* amino adds 24-191 of ^^^^^^'^^"^t^^&^^p, 
oreaared from human pituitaries by a pubbshed procedure . ds-cuNA 
a 5 ari was prepared from S of this RNA essentiaUy as descnbed by 
wSri- except that DNA polymerase 'Kleno* 1™ 
«^tettoted for DNA polymerase I in the second strand synthesis, cDNA 
sSwi «ated witKlII, elertrophoresed on an 8% PolJ^^f^ ^'^e 
«1 " nd the region around 550 base pairs eluted. Approamsrtdy 1 ng of 
S^NAwIrobtred-Terminaldeoxynudeotidyltransfer^^^ 
to add aroroximateiy 20 dC residues per 3' termiaus'". Annealing of the 
dclueTdSA^ththedG.taaedvec.c.DN^ 
in 130 id of lOmMTris-HCI (pH7.5) ^aQ. 0 25 mM EDTA^ 

The mixTure was heated to70'C, allowed to cool slowy^^^^^ 
to 20 "C (6 h), before being used to transform x (ref • 20) by » publtshed 
procedure . 



s' 



40 

" n vy.. Clu Glu Ala Tyr He Pro ty. Glu Gla Lys Tyr Ser fhe Leu Gin Asn Pro Gin Thr Ser 

:r. s z z z »: »: » » - - - - «-« ^ - - 



30 



- - - ::i - - z z z z z z z z z z z z z z z z z z z z z z 

100 

z z z z z z z z z z z z z z z z z z z z z z z z z z z z z z 

140 

120 

... ... .y. A. .e. .1. Glu Oly lie Gin XH. .eu «eC .ly A. A. Oly Se. P„ Ars Xh. Gly Gin Ue P.e .y. Ola 

GAC C.C CUA AAG OAC C.A GAG CAA GGC AUC CAA ACC C.C AUG GGG AGG CUG GAA GAU GGC A=C CCC COG AC. GCG C^.C AAG CAG 

z - - = - r - ^ ii: E 2 = ^ r ^ 

180 

Z Z Z Z Z Z Z Z Z Z 7^ z z z z zzz z z r c«...«=».».==»=""=. 

Pvu II IM. ^ 

Xma I 

Gccucnccu Gcce ....3' 

Board Assigned Page #1851 



JAN 29 2007 15:49 FR CISTI ICIST 



613 952 9303 TO 12122308888 



P. 06/07 
547 I 



Mature 



Vol. 281 18 October 1979 



plasmids 
inoassay 
rH were 
vative of 
itrary to 



TVir Ser 
ACC UCC 

30 
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cue CDG 



Val Tyr 
GDC UAU 
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R.. 4 Construction of a 
plasmid (pHOHlOT) lot the 
bacterial expression of human 
growth hormone. pHGHi 
(10-p.g) was deaved with 
EcoRl and Haelll restriction 
endonucleases and the 77- 
base pair fragment contunm| 
coding sequences for HGH 
amino adds 1-23 was isolated 

from an 8% P^Jy^^Ti'Sitj 
eel. The plasmid pHOHJl 

(-5 as) was <=l«*''*^ "'^ 
Hae^. The 551-base pa.r 
HGH sequence and a co- 
Sigrating 540-ba« P- 
Haem fragment of pBi«« 
were purified by gel elecKO" 
phoresis. Subsequent treat- 
ment with Xrrwl cleaved only 
the HGH sequence, temovmg 
39 base pairs from the i 
noncoding region. The resul- 
ting 512-base pair fragment 
was purified from the 540- 
base pair pBR322 HaAH 
piece by electrophoresis on a 
6% polyacrylamide gel. 
Samples of 0.3 of the 77- 
base pair EcdBl-HaAU 
fragment and 0.3 jig of the 
512-base pair HaelU-Xmdl 
fragment were polymerised 
with T* DNA ligase m a l6-ii! 
reaction for 14 hat 4 °C. The 
mixture was heated at 70 C 
for 5min to inactivate the 
ligase, then treated with 
EcaRl (to cleave fragments 
which had dimerised thMUgh 
their EcoSl sites) and with 
Smal (to cleave Xmal 
dimers), yielding a 591-baM 
pair fragment with an £coKi 
'sticky' end and a Smal 'blunt' 
end. After purification on a 
6% polyacrylamide gel, 
approximately 30 ng of this 
fragment were obtained. The 
expression plasmid pGH6 
containing tandem lac UV-5 
promoters, ww treated suc- 
cessively with HjndUl, 
nuclease Si, and EcdRi and 



correctly assembled gene when eleaveawKnnuc , 



the plasmid pHQHlu ' w^^"*"*'" ,. 2 HBlOl which 
strain D1210 an ^^"^^^t^^ ^^.^^^j.H fjemonstrate 
overproduces ^f'^'^ZocS^mS^odu^^^^ and that this 
iSSTre" SrSion 5 the inducer isopropyi- 
0 -D-thiogalactoside . 

Ideodflcatioii of HGH Itom bacterid extracts 

HGH, Ll extract of a 1-litre ^^^^^J^'^'^^^ffJ^^. ?Sc 
prepared and fractionated as descnbed in l^ig. = icg „ 
polyacrylamide gel electrophoresis pattern of tigs^|^^^|. 



— I Tj^u ;„ ritTsOi from bacteria containing 
Table 1 Radioimmunoassay of HGH """^ 
expression p'asmtds 



Strain 
xl776/pHGH107 

;fl776/pHGH107-l 

;fl776/pBR322 
D1210/pHGHlO7 
D1210/pHGHlO7(IPTG) 



Cell density 
(cells per ml) 

3.69x10* 
1x10' 
3.6x10' 
ixlO' 
3.6X10' 
3.8x10' 
3.8x10' 



2.4 
1.4 
1.5 
0.5 
0 

2x10" 
1.0 



HGH copies 
per cell 

lg6,000 
39,000 
116,000 
14,000 
0 
15 

75,000 



E. ..n trains cont^ning the a^ro^^e P^ ™^ 

cell density and ""^"^f f »^Sr;^*'^s^end^ in 55 ,U of 50mM 
contain 10' cefls per -"l; JJ?* f "P^** lysoiyme, 0.025% 
Tris-HCl (pH8). SO^M^i^J^^J,** '"TT'C IM of ISOmMTris-HCl 
lithium dodecylsulphate. After 30 mm at 0 J l^K^ , ^^^^ The 
(PH7.5). 280mMMga„ 4?^^=^^^ *°°,\'^fi^drtionsofthesup«natants 
mixture was centrituged for 1 5 mm at ^^.-^^^^yS^bT^^ PMST kit (Phar- 
were analysed by 'adioimmunoaMay i« ng to^^^ ^ j ^ 

nauy concentration of 2 mM. 
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contains partiaUy punfied HGH ^/f™,3e slab gel using the buffer 
samples were sep^^ted on a ^^t^^^^'^^^Z^rT Side l^^tes were 

prepared by growing ceils in LB with * 1^8 J oredpitated with 10 

in 2% SDS, ,^,rJeU?ts weS^^^^ SDS sample 

volumes of cold acetone and the peuetswere icuu.." 
bufl^ or use in gei electrophoresis. The P^f^^^^f^ ^^^^ 
prepared fromas.a«phasecul^^^^ 

collected res^pend^d m ^^^^^^^,''^,€1 and lysed by soni- 
potassium phosphate ^^l^l^''^^^^^) added lo 0.2%. After 

::?rar;to% s—. -tnAlTo'stNra nd 
Sa;:^e^^^|=^«H«S 

mediuB.^»conta.omg0.2.DC.ml SO.,cnaseo Following 
urea and SDS as described above. 



purified sample is shown in Fig. 5a, slot 4. The most comment 
Cdco-migrateswithHGHstandardanditsrelaUveintena^is 

SsTenTwith the 25% purity calculated from rad.omimune 

'1^ independent attempt to confirm that the antigenicdly 
actite component in the bacterial extract, co-migrates with 

Received 6 July; jcoepted 16 August 1979. 
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HHH in SDS eels we precipitated extracts from cells labelled 
whh -S wUh S seTum prepared against HGH and separated 
SoredSedproteinsbypolyacrylamidegelelectrophor^^^^ 
m auSogram of the immunoprecipitated "^atenal (Fig. 

,lot TSws a major band, with electrophoretic mobihty 
5b, slot -1) shows a mdjoi u , gmaUer bands. All five 

identical to that of HGH, as weu as rour '^(i.iaheUeH 
of these bands are present in the unfractionated 5>-la.belled 
WsIte felotT) whereas only the band with the same mobility ^ 
SgH t^n be Sn in unlabeUed extracts o^.^ell^.^^l^e 
broth (Fig 5a, slot 2). Allfive immune preapitated bands can be 
rmpefedou?byexcesscoldHGH(datanots^^^^^ 
aSenically related. This suggests that the four smaller HGH- 
S^prS;^^^ observed in minima! media may be due to 

'™A55S£nff fvldSof proteolytic degradation of HGH is 
or^enedtoTablel.ThelevelsofHGHin extracts from^^^^^ 
SntaSing PHGHIO? or pHGH107-l were considerably 
SS S log phase cultures than in stationary-phase cultures. 
sJatas Sansf onned with several other plasmids whose products 
aS inderTc promoter control show no comparable decrease of 
Sne Sreslion as a function of growth phase (unpublished 
fesultsK his laboratory). It thus seems that there is some 
uraover o?HGH in bacterial ceUs although we «innot rule om 
Sr exiJanations for the four smaUer HGH-related products 
S the ^Tlabelled extracts such as premature termmation of 

" DeSte the observed instability of HGH in certain growU. 

0^;^: SUIT'S 

genes using the same promoter in optimised conditions . 

Conclusion . .,^vta a 

Using a novel combination of chemicaUy synthesised DNA and 
rnNA a recombinant E. coli strain has been constructed which 
?oduc;es St large amounts. This is the first ^e tha a 
hmnan oolvpeptide has been directly expressed m E. coli m a 
nr-precS form. The hybrid DNA cloning techniqug 
described as a route to the cloning and expression of HGH 
ceding sequences in E. coli are generally applicable to other 
TofetWes which are synthesised initially '"^^^^-^P^^^ 
cureors and later processed, or for which full length cDNA 
transcripts are unavailable. . . • ™ni<»rt 
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[57] ABSTRACT 

A cDNA library is constructed using mRNA from human 
fibroblasts induced with poly(I):poly(C). A bacterial clone 
containing fibroblast interferon cDNA sequences identified 
by hybridization to a cDNA probe synthesized using 
deoxyoligonucleotide primers which hybridize to fibroblast 
interferon mRNA specifically. Expression plasmids are con- 
structed which permit the synthesis in E. coli of 8x10^ units 
of human fibroblast interferon per liter of culture. The 
bacterially produced fibroblast interferon is indistinguish- 
able from authentic human fibroblast interferon by several 
critieria. 
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MATURE HUMAN FIBROBLAST 
INTERFERON 



CROSS-REFERENCE TO RELATED 
APPLICATION 

This is a continuation of application Ser. No. 889,722, 
filed Jul. 28, 1986, now abandoned which is a divisional of 
application Ser. No. 291,892, now abandoned filed Aug. 11, 
1981, which is a continuation-in-part of application Ser. No. 
190,799, filed Sep. 25, 1980, now abandoned. 

FIELD OF THE INVENTION 

This invention relates to the microbial production, via 
recombinant DNA technology, of human fibroblast inter- 
feron for use in the treatment of viral and neoplastic dis- 
eases, and to the means and end products of such production. 

BACKGROUND OF THE INVENTION 

The publications and other materials referred to herein to 
illuminate the background of the invention and, in particular 
cases, to provide additional detail respecting its practice are 
incorporated herein by reference and, for convenience, are 
numerically referenced in the following text and respec- 
tively grouped in the appended bibliography. 
Recombinant DNA Technology 

With the advent of recombinant DNA technology, the 
controlled microbial production of an enormous variety of 
useful polypeptides has become possible. Already in hand 
are bacteria modified by this technology to permit the 
production of such polypeptide products such as somatosta- 
tin, the (component) A and B chains of human insulin, 
human growth hormone. More recently, recombinant DNA 
techniques have been used to occasion the bacterial produc- 
tion of proinsulin, thymosin alpha 1, (an immune potenti- 
ating substance produced by the thymus) and leukocyte 
interferon. 

The workhorse of recombinant DNA technology is the 
plasmid, a non-chromosomal loop of double-stranded DNA 
found in bacteria and other microbes, oftentimes in multiple 
copies per cell. Included in the information encoded in the 
plasmid DNA is that required to reproduce the plasmid in 
daughter cells (i.e., a "replicon") and ordinarily, one or more 
selection characteristics such as, in the case of bacteria, 
resistance to antibiotics which permit clones of the host cell 
containing the plasmid of interest to be recognized and 
preferentially grown in selective media. The utility of plas- 
mids lies in the fact that they can be specifically cleaved by 
one or another restriction endonuclease or "restriction 
enzyme", each of which recognizes a different site on the 
plasmidic DNA. Thereafter heterologous genes or gene 
fragments may be inserted into the plasmid by endwise 
joining at the cleavage site or at reconstructed ends adjacent 
to the cleavage site. DNA recombination is performed 
outside the cell, but the resulting "recombinant" plasmid can 
be introduced into it by a process known as transformation 
and large quantities of the heterologous gene-containing 
recombinant plasmid obtained by growing the transformant 
Moreover, where the gene is properly inserted with refer- 
ence to portions of the plasmid which govern the transcrip- 
tion and translation of the encoded DNA message, the 
resulting expression vehicle can be used to actually produce 
the polypeptide sequence for which the inserted gene codes, 
a process referred to as expression. 
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Expression is initiated in a region known as the promoter 
which is recognized by and bound by RNA polymerase. In 
some cases, as in the tryptophan or "trp" promoter preferred 
in the practice of the present invention, promoter regions are 

5 overlapped by "operator" regions to form a combined pro- 
moter-operator. Operators are DNA sequences which are 
recognized by so-called repressor proteins which serve to 
regulate the frequency of transcription initiation at a par- 
ticular promoter. The polymerase ttavels along the DNA, 

10 transcribing the information contained in the coding strand 
from its 5' to 3' end into messenger RNA which is in turn 
translated into a polypeptide having the amino acid sequence 
for which the DNA codes. Each amino acid is encoded by a 
nucleotide triplet or "codon" within what may for present 

15 purposes be referred to as the "structural gene", i.e. that part 
which encodes the amino acid sequence of the expressed 
product. After binding to the promoter, the RNA polymerase 
first transcribes nucleotides encoding a ribosome binding 
site, then a translation initiation or "start" signal (ordinarily 

20 ATG, which in the resulting messenger RNA becomes 
AUG), then the nucleotide codons within the structural gene 
itself. So-called stop codons are transcribed at the end of the 
structural gene whereafter the polymerase may form an 
additional sequence of messenger RNA which, because of 

25 the presence of the stop signal, will remain untranslated by 
the ribosomes. Ribosomes bind to the binding site provided 
on the messenger RNA, in bacteria ordinarily as the mRNA 
is being formed, and themselves produce the encoded 
polypeptide, beginning at the translation start signal and 

30 ending at the previously mentioned stop signal. The desired 
product is produced if the sequences encoding the ribosome 
binding site are positioned properly with respect to the AUG 
initiator codon and if all remaining codons follow the 
initiator codon in phase. The resulting product may be 

35 obtained by lysing the host cell and recovering the product 
by appropriate purification from other bacterial protein. 
Fibroblast Interferon 

Human fibroblast interferon (FIF) is an antiviral protein 
which also exhibits a wide range of other biological activi- 

40 ties (see ref. 1 for review). It has reportedly been purified to 
homogeneity as a single polypeptide of 19,000-20,000 
molecular weight having a specific activity of 2 to 10x10^ 
units/mg (2,3). The sequence of the 13 NH2-terminal amino 
acids of PIP has been determined (4). Houghton et al. (5) 

45 have used synthetic deoxyoligonucleotides (predicted from 
this amino acid sequence) to determine the sequence of the 
276 S'-terminal nucleotides of FIF mRNA. Taniguchi et al. 
(6) and Derynck et al. (7) have recenfly employed RNA 
selection procedures to identify cloned cDNA copies of FIF 

50 mRNA in E. coli. See also Taniguchi et al.. Gene 10, 11 
(1980) and Proc. Natl. Acad. Sci. (U.S.A.) 77, 5230 (1980) 
and Nature 285, 547 (1980). 

While isolation from donor fibroblasts has provided suf- 
ficient material for partial characterization and limited clini- 

55 cal studies with homogeneous fibroblast interferon, it is a 
totally inadequate source for the amounts of interferon 
heeded for large scale clinical trials and for broad scale 
prophylactic and/or therapeutic use thereafter. Indeed, pres- 
enfly clinical investigations employing human fibroblast- 

60 derived interferon in antitumor and antiviral testing have 
principally been confined to crude (<1 percent pure) prepa- 
rations of the material, and long lead times for the manu- 
facture of sufEcient quantities, even at unrealistic price 
levels, have critically delayed investigation on an expanded 

65 front. 

We perceived that application of recombinant DNA tech- 
nology would be the most effective way of providing large 



Board Assigned Page #1 863 



5,460, 

3 

quantities of fibroblast interferon which, despite the absence 
in material so produced of the glycosylation characteristic of 
human-derived material, could be employed clinically in the 
treatment of a wide range of viral and neoplastic diseases. 

More particularly, we proposed and have since succeeded 5 
in producing mature human fibroblast interferon microbi- 
ally, by constructing a gene therefor which could then be 
inserted in microbial expression vehicles and expressed 
under the control of microbial gene regulatory controls. 

Our approach to obtaining a fibroblast gene involved the 10 
following tasks: 

1. Partial amino acid sequences would be obtained by 
characterization of fibroblast interferon purified to 
essential homogeneity, and sets of synthetic DNA 
probes constructed whose codons would, in the aggre- 15 
gate, represent all the possible combinations capable of 
encoding the partial amino acid sequences. 

2. Bacterial colony banks would be prepared containing 
cDNA from induced messenger RNA. The probes of 
part (1) would be used to prime the synthesis of 
radio-labelled single stranded cDNA for use as hybrid- 
ization probes. The synthetic probes would hybridize 
with induced mRNA as template and be extended by 
reverse transcription to form induced, radio-labelled 
cDNA. Clones from the colony bank that hybridized to 
radio-labelled cDNA obtained in this manner would be 
investigated further to confirm the presence of a full- 
length interferon encoding gene. Any partial length 
putative gene fragment obtained would itself be used as 
a probe for the full-length gene. 

3. The full-length gene obtained above would be tailored, 
using synthetic DNA, to eliminate any leader sequence 
that might prevent microbial expression of the mature 
polypeptide and to permit appropriate positioning in an 
expression vehicle relative to start signals and the 
ribosome binding site of a microbial promoter. 
Expressed interferon would be purified to a point 
permitting confirmation of its character and determi- 
nation of its activity notwithstanding the absence of 
glycosylation. 

SUMMARY OF INVENTION 

A series of replicable plasmidic expression vehicles have 45 
been constructed which direct the high level synthesis in 
transformant microorganisms of a mature polypeptide with 
the properties of authentic human fibroblast interferon. The 
product polypeptide exhibits the amino acid sequence of 
such interferon and is active in in vitro testing despite the 50 
lack of glycosylation characteristic of the human-derived 
material. Reference herein to the expression of "mature 
fibroblast interferon," connotes the bacterial or other micro- 
bial production of an interferon molecule unaccompanied by 
associated glycosylation and the presequence that immedi- 55 
ately attends mRNA translation of the human fibroblast 
interferon genome. Mature fibroblast interferon, according 
to the present invention, is immediately expressed from a 
translation start signal (ATG) which also encodes the first 
amino acid codon of the natural product. The presence or 60 
absence of the methionine first amino acid in the microbially 
expressed product is governed by a kinetic phenomenon 
dependent on fermentation growth conditions and/or levels 
of expression in the transformant host. Mature fibroblast 
interferon could be expressed together with a conjugated 65 
protein other than the conventional leader, the conjugate 
being specifically cleavable in an intra- or extracellular 
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environment. See British Patent Publication No. 2007676A. 
Finally, the mature interferon could be produced in conjunc- 
tion with a microbial "signal" peptide which transports the 
conjugate to the cell wall, where the signal is processed 
away and the mature polypeptide secreted. 

DESCRIPTION OF THE FIGURES 

FIG. 1 depicts the protein sequence information used to 
design degenerate dodecanucleotide primers. Below the 
protein sequence are corresponding mRNA sequences and 
six pools of complementary deoxyoligonucleotide primers. 

HGS. 2A-C are autoradiographs of nitrocellulose-bound 
plasmid DNA prepared from some of the 600 bacterial 
transformants having DNA from the fibroblast cDNA 
library. The plasmid DNA of the nitrocellulose membrane 
was hybridized with either probe A, B, or C. 

FIG. 3 presents DNA sequence of the cDNA insert of 
clone pFIF3 as determined by the Maxam-Gilbert proce- 
dure. The deduced protein sequence in printed above the 
DNA sequence. 

FIG. 4 is a diagram showing the approach used to remove 
the signal peptide coding regions from pFIFS. 

FIG. 5 is an autoradiograph of the polyacrylamide gel 
used to isolate the 141bp fragment lacking the signal peptide 
coding regions. 

FIG. 6, and 6a, 6b, 6c schematically depicts the construc- 
tion of plasmids coding for the direct expression of mature 
fibroblast interferon. Restriction sites and residues are as 
shown ("Pst I", etc.). "Ap'^" and "Tc'*" connote portions of 
the plasmid (s) which express, respectively, ampicillin and 
tetracycline resistance. The legend "p o" is an abbreviation 
for "promoter operator." 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

A. Microorganisms Employed 

The work described involved used of the microorganism; 
E. coli K-12 strain 294 (end A, thi~, hsr~, hsm^.*), as 
described in British Patent Publication No. 2055382 A. This 
strain has been deposited on Oct. 28, 1978 with the Ameri- 
can Type culture Collection, which is located at 12301 
Parklawn Drive. Rockville, Md. 20852, and given ATCC 
Acession No. 31446. All recombinant DNA work was 
performed in compliance with applicable guidelines of the 
National Institutes of Health. 

The invention, in its most preferred embodiments, is 
described with reference to E. coli, including not only strain 
E. coli K-12 strain 294, defined above, but also other known 
E. coli strains such as E. coli B, E. coli x 1776 and E. coli 
W 3110, or other microbial strains many of which are 
deposited and (potentially) available from recognized 
microorganism depository institutions, such as the American 
Type Culture Collection (ATCC)— cf the ATCC catalogue 
listing, See also German Offenlegungschrift 2644432. These 
other microorganisms include, for example. Bacilli such as 
Bacillus subtilis and other enterobacteriaceae among which 
can be mentioned as examples Salmonella typhimurium and 
Serratia marcesans, utilizing plasmids that can replicate and 
express heterologous gene sequences therein. Yeast, such as 
Saccharomyces cerevisiae, may also be employed to advan- 
tage as host organism in the preparation of the interferon 
proteins hereof by expression of genes coding therefor under 
the control of a yeast promoter. 
MATERIALS AND METHODS 

General methods. 
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Restriction enzymes were purchased from New England 
Biolabs and used as directed. Plasmid DNA was prepared by 
a standard cleared lysate procedure (8) and purified by 
column chromatography on Biogel A-50M (Bio-Rad). DNA 
sequencing was performed using the method of Maxam and 5 
Gilbert (9). DNA restriction fragments were isolated from 
polyacrylamide gels by electroelution. DNA fragments were 
radiolabeled for use as hybridization probes by the random 
calf thymus DNA priming procedure of Taylor et al. (10). In 
sito colony hybridizations were performed by the Grunstein- lo 
Hogness procedure (11). 

Chemical synthesis of deoxyoligonucleotides. 

The deoxyoligonucleotides were synthesized by the 
modified phosphotriestcr method in solution (12), using 
trideoxynucleotides as building block (13). The material and 15 
general procedures were similar to those described (14). The 
six pools of primers (Fib 1-6) containing four dodecanucle- 
otides each were obtained by separately coupling two hex- 
amer pools (of two different 5'-terminal sequences each) 
with three different hexamer pools (of two different 3'-ter- 20 
minal sequences each). 

Induction of fibroblasts. 

Human fibroblasts (cell line GM-2504A) were grown as 
described previously (15). Growth medium (Eagles's mini- 
mal essential medium containing 10 percent fetal calf 25 
serum) was removed from roller bottles (Coming, 850 cm^) 
and replaced with 50 ml growth medium containing 50 
|ig/ml of poly (I):poly (C) (PL Biochemicals) and 10 (ig/ml 
cycloheximide. This induction medium was removed after 4 
hours at 37° C. and cell monolayers were washed with PBS 30 
(0.14M NaCl, 3 mM KCl, 1.5 mM KH2PO4, 8 mM 
Na2HP04). Each bottle was incubated at 37° C. with 10 ml 
of a trypsin-EDTA solution (Gibco 610-5305) until cells 
were detached, and fetal calf serum was added to a concen- 
tration of 10 percent. Cells were spun for 15 minutes at 500x 35 
g and pellets were resuspended in PBS, pooled, and resedi- 
mented. Cells were frozen in liquid nitrogen. Approximately 
0.17 g of cells were obtained per roller bottle. 

Preparation and assay of interferon mRNA. 

Poly extraction and oligo(dT)-ceIlulose chromatography 40 
as described elsewhere (16). The poly (A) containing RNA 
was enriched for interferon mRNA by centrifugation on a 
linear 5 percent to 20 percent (w/v) sucrose gradient. The 
RNA samples were heated to 80° C. for 2 minutes, rapidly 
cooled, layered over the gradient, and centrifuged for 20 45 
hours at 30,000 rpm at 4° C. in a Beckman SW-40 rotor. 
Fractions were collected, ethanol precipitated, and dissolved 
in HjO. 

One microgram samples of mRNA were injected into 
Xenopus laevis oocytes as described previously (17,18 ). 50 
The injected oocytes were incubated 24 hours at 21° C, 
homogenized, and centrifuged for 5 minutes at 10,000x g. 
The interferon in the supernatant was determined by the 
cytopathic effect (CPE) inhibition assay (1) using Sindbis 
virus and human diploid (WISH) cells. Interferon titers of 55 
1,000 to 6,000 units recovered (NTH reference standard) per 
microgram of RNA injected were routinely obtained for the 
12s species of mRNA. 

Synthesis and cloning of cDNA. 

Single stranded cDNA was prepared in 100 pi reactions 60 
containing 5 pg of 12S fraction mRNA, 20 mM Tris-HCl 
(pH 8.3), 20 mM KCl, 8 mM MgCl^, 30 mM P-mercapto- 
ethanol, 100 pCi of (a^^p)dCTP (Amersham) and 1 mM 
dATP, dCTP, dGTP, dTTR The primer was the synthetic 
Hind III decamer dCCAAGCTTGG (19), which had been 65 
extended at the 3' terminus with about 20 to 30 deoxythy- 
midine residues using terminal deoxynucleotidyl transferase 



(20). 1 00 units of AMV reverse transcriptase were added and 
the reaction mixture was incubated at 42° C. for 30 minutes. 
The second strand DNA synthesis was carried out as 
described previously (21). The double stranded cDNA was 
treated with 1200 units of SI nuclease (Miles Laboratories) 
for 2 hours at 37° C. in 25 mM sodium acetate (pH 4.5), 1 
mM ZnClz, 0.3M NaCl. After phenol extraction the mixture 
was separated electrophoreticilly on an 8 percent polyacry- 
lamide gel. cDNA ("0.5 pg) ranging from 500 to 1500 base 
pairs in size was recovered by electroelution. A 20 ng aliquot 
was extended with deoxyC residues using terminal deoxy- 
nucleotidyl transferase (20), and annealed with 100 ng of 
pBR322 which had been cleaved with Pst I and tailed with 
deoxyG residues (20). The annealed mixture was used to 
transform E. coli k-12 strain 294 (22) by a published 
procedure (23). Strain 294 was used throughout in the work 
described here, and has been deposited with the American 
Type Culmre collection, accession no. 31446. 

Preparation of induced and uninduced '^p-cDNA probes. 
5 pg of 12s mRNA were combined with either 2 pg of oligo 
(dT)i2.i8 (Collaborative Research) or5 pg of each synthetic 
primer pool (Fib 1 to Fib 6) in 60 pi of 10 mM Tris-HCl (ph 
8), 1 mM EDTA. The mixtures were boiled 3 minutes, and 
quenched on ice. 60 pi of 40 mM Tris-HCl (pH 8.3), 40 mM 
KCl, 16 mM MgCl2,60 mM P-mercaptoethanol, 1 mM 
dATP, dGTP, dTTP and SxlQ-^M (a-^^p) dCTP (Amersham, 
2,000-3,000 Ci/mmole) was added to each template-primer 
mix at 0° C. After the addition of 100 units of AMV reverse 
transcriptase, the reactions were incubated at 42° C. for 30 
minutes and purified by passage over 10 ml Sephadex G-50 
columns. The products were treated with 0.3N NaOH for 30 
minutes at 70° C, neutralized, and ethanol precipitated. 

The ^^p-cDNAs were combined with 100 pg of poly (A) 
mRNA from uninduced fibroblasts ui 50 pi of 0.4M sodium 
phosphate (pH6.8), 0.1 percent SDS. The mixtures were 
heated at 98° C. for 5 minutes and allowed to anneal 15 
hours at 45° C. The DNA-RNA hybrids, (containing unin- 
duced cDNA sequences) were separated from single- 
stranded DNA (induced cDNA sequences) by chromatogra- 
phy on hydroxyapatite as described by Galau et al. (24). The 
DNA-RNA hybrids were treated with alkali to remove RNA. 

Screening of recombinant plasmids with ^^p-cDNA 
probes. 

Approximately 1 pg samples of plasmid DNA were 
prepared from individual transformants by a published pro- 
cedure (25). The DNA samples were linearized by digestion 
with Eco RI, denatured in alkali, and applied to each of three 
nitrocellulose filters (Schleicher and Schuell, BA85) by the 
dot hybridization procedure (26). The filters were hybridized 
with the ^^p-cDNA probes for 16 hours at 42° C. in 50 
percent formamide, lOx Denhardt's solution (27), 6xSSC, 
40 mM Tris-HCl (pH 7.5), 2 mM EDTA, 40 pg/ml yeast 
RNA. Filters were washed witii O.lxSSC, 0.1 percent SDS 
twice for 30' at 42° C, dried, and exposed to Kodak XR-2 
x-ray film using Dupont Lightning-Plus intensifying screens 
at -80° C. 

Construction of plasmids for direct expression of FIR 
The synthetic primers I (dATGAGCTACAAC) and n 
(dCATGAGCTACAAC) were phosphorylated using T4 
polynucleotide kinase and (y-^^.P)ATP (Amersham) to a 
specific activity of 700 Ci/mmole as described previously 
(28). Primer repair reactions were performed as follows: 250 
pmoles of the ^^P-primers were combined with 8 pg (10 
pmole) of a 1200 bp Hha I restriction fragment containing 
the FIF cDNA sequence. The mixture was ethanol precipi- 
tated, resuspended in 50 pi H2O, boiled 3 minutes, quenched 
in a dry ice-ethanol bath, and combined with a 50 pi solution 
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of 20 mM Tris-HCl (pH 7.5), 14 mM MgC^, 120 mM NaCl, 
0.5 mM dATP, dCTP, dGTP, dTTP at 0° C. 10 units of DNA 
polymerase I Klenow fragment (Boehringer-Mannheim) 
were added and the mixture was incubated at 37° C. for 4Vi 
hours. Following extraction with phenol/CHCl3 and restric- 5 
tion with Pst I, the desired product was purified on a 6 
percent polyacrylamide gel. Subsequent ligations were done 
at room temperature (cohesive termini) or 4° C. (blunt ends) 
using previously detailed conditions (21,28). 

Assay for interferon expression in E. colt to 
Bacterial extracts were prepared for IF assay as follows: 
One ml cultures were grown overnight in LB (29) containing 
5 ng/ml tetracycline, then diluted into 25 ml of M9 medium 
(29) containing 0.2 percent glucose, 0.5 percent casamino 
acids and 5 pg/ml tetracycline. 10 ml samples were har- 15 
vested by centrifugation when A550 (Absorbance at 500 
nanometers) reached 1.0. The cell pellets were quickly 
frozen in a dry ice-ethanol bath and cleared lysates were 
prepared as described by Clewell (8). Interferon activity in 
the supematants was determined by comparison with NIH 20 
FTP standards using cytopathic eifect (CPE) inhibition 
assays as reviewed previously (1). Two different assays were 
used: (a) WISH (human amnion) cells were seeded in 
microtiter dishes. Samples were added 16 to 20 hours later 
and diluted by serial 2-foId dilution. Sindbts virus was added Z5 
after at least 3 hours in incubation. Plates were stained 20 to 
24 hours later with crystal violet, (b) MDBK (bovine kid- 
ney) cell line was seeded simultaneously with 2-fold dilu- 
tions of samples. Vesicular stomatitis vims was added after 
2 to 3 hours incubation and plates were stained with crystal 30 
violet 16 to 18 hours later. To test pH 2 stability bacterial 
extracts and standards were diluted in minimal essential 
medium to a concentration of 1 000 units/ml. One ml aliquots 
were adjusted to pH 2 with IN HCl, incubated at 4° C. for 
16 hours, and neutralized by addition of NaOH. IF activity 35 
was determined by the CPE inhibition assay using human 
amnion cells. To establish antigenic identity 25 |il aliquots of 
the 1000 U/tnl interferon samples (untreated) were incu- 
bated with 25 |il of rabbit antihuman leukocyte interferon for 
60' at 37° C, centrifuged at 12,000x g for 5 minutes and the 40 
supernatant assayed. Fibroblast and leukocyte interferon 
standards were obtained from the National Institutes of 
Health. Rabbit antihuman leukocyte interferon was obtained 
from the National Institute of Allergy and Infectious Dis- 
eases. 45 
RESULTS 

Chemical synthesis of primer pools complementary of 
HF mRNA. 

The amino-terminal protein sequence of human fibroblast 
interferon (4) permitted us to deduce the 24 possible mRNA 50 
sequences which could code for the first four amino acids. 
The 24 complementary deoxyoligonucleotides were synthe- 
sized in 6 pools of 4 dodecamers each (FIG. 1). 

The six pools of 4 deoxyoligonucleotides each were 
synthesized by a modified phosphotriester method that has 55 
been used previously for the rapid synthesis of oligonucle- 
otides in solution (12) and on solid phase (14). The basic 
strategy involved reacting two different 3'-blocked trimers 
with an excess of a single 5'-protected trimer to yield a pool 
of two hexamers, each represented equally. The coupling of 50 
two pools, each containing two hexamers, then resulted in a 
pool of four dodecamers. 

Identification of FIF cDNA clones. 

Using 12s mRNA from induced human fibroblasts (1 ,000 
units IF activity per ng in oocyte assay), double stranded 65 
cDNA was prepared and inserted into pBR322 at the Pst I 
site by the standard dG:dC tailing method (20). A fibroblast 
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cDNA library consisting of 30,000 ampicillin-sensitive, tet- 
racycline-resistant transformants of E. coli K-12 strain 294 
was obtained from 20 ng of cDNA ranging in size from 550 
to 1300 base pairs. Plasmid DNA was prepared from 600 of 
the transformants and applied to 3 sets of nitrocellulose 
filters as described in Materials and Methods. 

The approach followed in the identification of hybrid 
plasmids containing fibroblast interferon cDNA sequences 
was similar to that used to identify human leukocyte inter- 
feron recombinant plasmids (30). Radiolabeled cDNA 
hybridization probes were prepared using either the 24 
synthetic dodecamers or oligo(dT)i2.is as primers and 12S 
RNA from induced fibroblasts (5000 units/ng in oocytes) as 
template. The ^^P-cDNAs (specific activity >5xlO*' cpm/|ig) 
obtained were hybridized to a large excess of mRNA iso- 
lated from uninduced human fibroblasts, and the mRNA- 
cDNA hybrids were separated from unreacted cDNA by 
hydroxyapatite chromatography (24). The single stranded 
cDNA fractions should be enriched for sequences which arc 
present in induced fibroblasts but absent in uninduced cells, 
and the mRNA-cDNA hybrids should represent sequences 
common to both induced and uninduced cells. Approxi- 
m,ately 4x10" cpm of single stranded cDNA (hybridization 
probe A) and 8x10'' cpm of cDNA-mRNA hybrids were 
obtained using oligo(dT)i2_jg primed cDNA; 1.5x10" cpm 
of single stranded (hybridization probe B) and 1.5x10^ cpm 
of hybrids were obtained from cDNA primed using synthetic 
dodecamer pools Fib 1-6. The cDNA-mRNA hybrids from 
both fractionations were combined, the RNA hydrolyzed by 
treatment with alkali, and the ^^P-cDNA used as hybridiza- 
tion probe C. Many of the 600 plasmid samples hybridized 
with both probes A and C, indicating that the hybridization 
reactions between uninduced mRNA and ^^P-cDNA (prior 
to the hydroxyapatite fractionation step) had not gone to 
completion. However, only one of the 600 plasmids (pF526) 
hybridized strongly with the specifically primed, induced 
cDNA probe B (FIG. 2). Plasntid pF526 also hybridized 
with the total oligo(dT)i2.i8 primed, induced cDNA probe 
A, and failed to give detectable hybridization to the com- 
bined uninduced probe C. 

Pst I digestion of pF526 showed the cloned cDNA insert 
to be about 550 base pairs long, probably too short to contain 
the entire coding region for a protein the size of fibroblast 
interferon. Therefore, a ^^P-labeled DNA probe was pre- 
pared from this Pst I fragment by random priming with calf 
thymus DNA (10). This probe was used to screen 2000 
individual colonies from a newly constructed fibroblast 
cDNA library (the new cDNA library was prepared using 
12s mRNA from induced fibroblasts having a titer of 6,000 
units/ml in the oocyte assay system). Sixteen clones hybrid- 
ized to the probe. Plasmids prepared from the majority of 
these released two fragments when cleaved with Pst 1, 
indicating that the cDNA contained an internal Pst I site. 
Clone pFIF3 contained the largest cDNA insert, about 800 
base pairs. The DNA sequence of the insert was determined 
by the Maxam-Gilbert procedure (9) and is shown in FIG. 3. 
The amino acid sequence of human fibroblast interferon 
predicted from the nucleotide sequence is identical to that 
reported recently by Taniguchi et al. (31) and by Derynck ot 
al. (7) from DNA sequencing of FIF cDNA clones. A 
precursor or signal peptide of 21 amino acids is followed by 
a mature interferon polypeptide of 166 amino acids, a stretch 
of 196 3'-untranslated nucleotides and a poly(A) tail. The 
NH2-terminal 20 amino acids of mature FIF have been 
directly determined by protein microsequencing and arc the 
same as those predicted from the DNA sequence. The 
calculated formula molecular weight of mature human flbro- 
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blast interferon having the 166 amino acids shown in FIG. 
3 is about 20,027. 

Direct expression of fibroblast interferon. 

To express high levels of mature fibroblast interferon in E. 
coli, initiation of protein synthesis must occur at the ATG 5 
codon of the mature polypeptide (amino acid 1) rather than 
at the ATG of the signal peptide (amino acid SI) (FIG. 3). 

Our approach to removing the signal peptide coding 
regions from pFIF3 is depicted in TIG. 4. A 1200 bp DNA 
fragment which contained the entire cDNA insert was iso- 10 
lated from a polyacrylamide gel after digesting pFIF3 with 
Hha I. Two separate synthetic deoxyoligonucleotide prim- 
ers, dATGAGCTACAACa) and dCATGAGCTACAAC(n), 
were prepared. Both primers contain the coding sequence for 
the first four amino acids of mature fibroblast interferon; 15 
primer 11 has an additional C at the 5'-terminus. Primer 
repair reactions and subsequent ligations were carried out 
separately for primers I and n, and gave nearly identical 
results. Therefore, only reactions using primer I are dis- 
cussed in detail here. The primers were 5'-radiolabeled using 20 
(7-^^P)ATP and T4 polynucleotide kinase, combined with 
the 1200 bp Hha I DNA fragment and the mixture denatured 
by boiling. Following hybridization of the primer to the 
denatured Hha I DNA fragment, E. coli DNA polymerase I 
Klenow fragment (33) was used to catalyze the repair 25 
synthesis of the plus (top) strand (FIG. 4). In addition, the 
associated 3' §5' exonuclease activity of the Klenow frag- 
ment removed the 3 '-protruding end from the minus (bot- 
tom) strand, leaving a flush end. Analysis of samples of the 
reaction mixture by polyacrylamide gel electrophoresis indi- 30 
cated that the repair synthesis did not go to completion, but 
stopped at several discrete sites. Therefore, the entire reac- 
tion mixture was treated with Pst I and the desired 141 bp 
fragment (180,000 Cerenkov cpm; "0.3 pmole) was purified 
by polyacrylamide gel electrophoresis (FIG. 5). Ligation of 35 
this fragment to 1 |ig (~4 pmole) of the 363 bp Pst I-Bgl n 
fragment isolated from pFFFS (FIG. 4), followed by Bgl n 
digestion, yielded 50,000 Cerenkov cpm (~0.1 pmole, ~30 
ng) of the 504 bp DNA fragment containing the entire 
coding sequence for mature fibroblast interferon. The same 40 
reactions using primerll gave 83,000 cpm ("0.15 pmole, ~50 
ng) of 505 bp product. 

The constmction of plasmids which direct the synthesis of 
human fibroblast interferon is outlined in FIG. 6. Separate 
expression plasmids were constructed which placed FIF 45 
synthesis under the control of the E. coli lac or trp promoter- 
operator systems. Both of these systems have proven useful 
for the direct expression of eukaryotic genes in E. coli: 
human growth hormone has been efficiently synthesized 
using the lac system (21) and human leukocyte interferon 50 
has been produced at high levels using the trp system (30) 
and Nature 287, 411 (1980). 

pBRH trp was digested with EcoRI restriction enzyme 
and the resulting fragment isolated by PAGE and electro- 
elution. EcoRI-digested plasmid pSom 11 (K. Itakura et al., 55 
Science 198, 1056 (1977); G.B. patent publication no. 2 007 
676 A) was combined with the above fragment. The mixture 
was ligated with T4 DNA ligase as previously described and 
the resulting DNA transformed into E. coli K-12 strain 294 
as previously described. Transformant bacteria were 60 
selected on ampicillin-containing plates. Resulting ampicil- 
lin-resistant colonies were screened by colony hybridization 
(M. Gmenstein et al., Proc Nat'l Acad Sci USA 72, 
3951-3965 [1975]) using as a probe the trp promoter- 
operator-containing the above fragment isolated from 65 
pBRHtrp, which had been radioactively labelled with p^^. 
Several colonies shown positive by colony hybridization 



were selected, plasmid DNA was isolated and the orientation 
of the inserted fragments determined by restriction analysis 
employing restriction enzymes Bglll and BamHI in double 
digestion. E. coli 294 containing the plasmid designated 
pSOM7A2, which has the trp promoter-operator fragment in 
the desired orientation was grown in LB medium containing 
10 |ig/ml ampicillin. The cells were grown to optical density 
1 (at 550 nM), collected by centrifugation and resuspended 
in M9 media in tenfold dilution. Cells were grown for 2-3 
hours, again to optical density 1, then lysed and total cellular 
protein analyzed by SDS (sodium dodccyl sulfate) urea (15 
percent) polyacrylamide gel electi-ophoresis (J. V. Maizel Jr 
et al., Meth Viral 5, 180-246 [1971]). 

Plasmid pBR322 was Hind III digested and the protruding 
Hind ni ends in turn digested with SI nuclease. The SI 
nuclease digestion involved treatment of 10 \ig of Hind 
m-cleaved pBR322 in 30 |j1 SI buffer (0.3M NaCl, 1 mM 
ZnClj, 25 mM sodium acetate, pH 4.5) with 300 units SI 
nuclease for 30 minutes at 15° C. The reaction was stopped 
by the addition of 1 ^1 of 30xSl nuclease stop solution 
(0.8M tiis base, 50 mM EDTA). The mixture was phenol 
extracted, chloroform extracted and ethanol precipitated, 
then EcoRI digested as previously described and the large 
fragment (1) obtained by PAGE procedure followed by 
electroelution. The fragment obtained has a first EcoRI 
sticky end and a second, blunt end whose coding strand 
begins with the nucleotide thymidine. 

Plasmid pSom7 A2, as prepared above, was Bgl II 
digested and the Bgl II sticky ends resulting made double 
stranded with the Klenow polymerase I procedure using all 
four deoxynucleotide triphosphates. EcoRI cleavage of the 
resulting product followed by PAGE and electroelution of 
the small fragment (2) yielded a Unear piece of DNA 
containing the tryptophan promoter-operator and codons of 
the LE' "Proximal" sequence upstream from the Bgl n site 
("LE'(p)"). The product had an EcoRI end and a blunt end 
resulting from filling in the Bgl II site. However, the Bgl 11 
site is reconstituted by ligation of the blunt end of the above 
fragment (2) to the blunt end of the above prepared fragment 
(1). Thus, the two fragments were ligated in the presence of 
T4 DNA ligase to form the recirculated plasmid pHKY 10 
which was propagated by tiransformation into competent E. 
coli strain 294 cells. 

Plasmid pGMl carries the E. coli tryptophan operon 
containing the deletion ALE1413 (G. F. Miozzari, et al., 
(1978) J. Bacteriology 133, 1457-1466)) and hence 
expresses a fusion protein comprising the first 6 amino acids 
of the trp leader and approximately the last third of the trp 
E polypeptide (hereinafter referred to in conjunction as LE'), 
as well as the trp D polypeptide in its entirety, all under the 
control of the trp promoter-operator system. The plasmid, 20 
|ig, was digested with the restriction enzyme PvuII which 
cleaves the plasmid at five sites. The gene fragments were 
next combined with EcoRI linkers (consisting of a self 
complementary oligonucleotide of the sequence: pCAT- 
GAATTCATG) providing an EcoRI cleavage site for a later 
cloning into a plasmid containing an EcoRI site. The 20 \ig 
of DNA fragments obtained from pGMl were treated with 
10 units T4 DNA ligase in the presence of 200 pico moles of 
the 5'-phosphorylated synthetic oUgonucleotide pCAT- 
GAATCATG and in 20 nl T4 DNA Ugase bufi'er (20 mM tiis, 
pH 7.6, 0.5 mM ATP, 10 mM MgQa, 5 mM dithiothreitol) 
at 4° C. overnight. The solution was then heated 10 minutes 
at 70° C. to halt ligation. The linkers were cleaved by EcoRI 
digestion and the fragments, now with EcoRI ends were 
separated using 5 percent polyacrylamide gel electrophore- 
sis (hereinafter "PAGE") and the three largest fragments 
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isolated from the gel by first staining with ethidium bromide, 
locating the fragments with ultraviolet light, and cutting 
from the gel the portions of interest. Each gel fragment, with 
300 microHters O.lxTBE, was placed in a dialysis bag and 
subjected to electrophoresis at 100 v for one hour in 0.1 x 5 
TBE buffer (TBE buffer contains: 10.8 gra tris base, 5.5 gm 
boric acid, 0.09 gm NajEDTA in 1 liter HjO). The aqueous 
solution was collected from the dialysis bag, phenol 
extracted, chloroform extracted and made 0.2M sodium 
chloride, and the DNA recovered in water after ethanol lo 
precipitation. The trp promotcr-opcrator-containing gene 
with EcoRI sticky ends was identified in the procedure next 
described, which entails the insertion of fragments into a 
tetracycline sensitive plasmid which, upon promoter-opera- 
tor insertion, becomes tetracycline resistant. 15 

Plasmid pBRHl (R. I. Rodriguez, et al. Nucleic Acids 
Research 6, 3267-3287 [1979]) expresses ampicilin resis- 
tance and contains the gene for tetracycline resistance but, 
there being no associated promoter, does not express that 
resistance. The plasmid is accordingly tetracycline sensitive. 20 
By introducing a promoter-operator system in the EcoRI 
site, the plasmid can be made tetracycline resistant. 

pBRHl was digested with EcoRI and the enzyme 
removed by phenol extraction followed by chloroform 
extraction and recovered in water after ethanol precipitation. 25 
The resulting DNA molecule was, in separate reaction 
mixtures, combined with each of the three DNA fragments 
obtained above and ligated with T4 DNA ligase as previ- 
ously described. The DNA present in the reaction mixture 
was used to transform competent E. coli K-12 strain 294, K. 30 
Backman et al., Proc Nat'l Acad Sci USA 73, 4174-4198 
[1976]) by standard techniques (V. Hershfield et al., Proc 
Nat' 1 Acad Sci US A 7 1 , 3455-3459 [ 1 974]) and the bacteria 
plated on LB plates containing 20 |ig/ml ampiciUin and 5 
|.ig/ml tetracycline. Several tectracycline-resistant colonies 35 
were selected, plasmid DNA isolated and the presence of the 
desired fragment confirmed by restriction enzyme analysis. 
The resulting plasmid is designated pBRHtrp. 

An EcoRI and BamHI digestion product of the viral 
genome of hepatitis B was obtained by conventional means 40 
and cloned into the EcoRI and BamHI sites of plasmid 
pGII6 (D. V. Goeddel et al.. Nature 281, 544 [1979])) to 
form the plasmid pHS32. Plasmid pHS32 was cleaved with 
Xbal, phenol extracted, chloroform extracted and ethanol 
precipitated. It was then treated with 1 |.il E. coli polymerase 45 
I, Klenow fragment (Boehringer-Mannheim) in 30 |4l poly- 
merase buffer (50 mM potas.sium phosphate pH 7.4, 7 mM 
MgCl2, 1 mM p-mercaptoethanol) containing 0. 1 mM dTTP 
and 0.1 mM dCTP for 30 minutes at 0° C. then 2 hr. at 37° 
C. This treatment causes 2 of the 4 nucleotides complemen- 50 
tary to the 5' protruding end of the Xbal cleavage site to be 
filled in: t,0170 

Two nucleotides, dC and dT, were incorporated giving an 
end with two 5' protruding nucleotides. This linear residue 
of plasmid pHS32 (after phenol and chloroform extraction 55 
and recovery in water after ethanol precipitation) was 
cleaved with EcoRI. The large plasmid fragment was sepa- 
rated from the smaller EcoRI-Xbal fragment by PAGE and 
isolated after electroelution. This DNA fragment from 
pHS32 (0.2 fjg), was ligated, under conditions similar to 60 
those described above, to the EcoRI-Taq I fragment of the 
tryptophan operon (~0.01 |ig), derived from pBRHtrp. 

In the process of ligating the fragment from pHS32 to the 
Eco RI-Taq I fragment,.. as described above, the Taq I 
protruding end is ligated to the Xbal remaining protruding 65 
end even though it is not completely Watson-Crick base- 
paired: t,0180 
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A portion of this ligation reaction mixture was trans- 
formed into E. coli 294 cells, heat treated and plated on LB 
plates containing ampicillin. Twenty-four colonies were 
selected, grown in 3 ml LB media, and plasmid isolated. Six 
of these were found to have Xbal site regenerated via E. coli 
catalyzed DNA repair and replication: t,0181 

These plasmids were also found to cleave both with 
EcoRI and Hpal and to give the expected restriction frag- 
ments. One plasmid, designated pTrp 14, was used for 
expression of heterologous polypeptides, as next discussed. 

The plasmid pHGH 107 (D. V. Goeddel et al. Nature, 281 , 
544, 1979) contains a gene for human growth hormone made 
up of 23 amino acid codons produced from synthetic DNA 
fragments and 163 amino acid codons obtained from 
complementary DNA produced via reverse transcription of 
human growth hormone messenger RNA. This gene, though 
it lacks the codons of the "pre" sequence of human growth 
hormone, does contain an ATG translation initiation codon. 
The gene was isolated from 10 jig pHGH 107 after treatment 
with EcoRI followed by E. coli polymerase I Klenow 
fragment and dTTP and dATP as described above. Following 
phenol and chloroform extraction and ethanol precipitation 
the plasmid was treated with BamHI. 

The human growth hormone ("HGH") gene-containing 
fragment was isolated by PAGE followed by electroelution. 
The resulting DNA fragment also contains the first 350 
nucleotides of the tetracycline resistance structural gene, but 
lacks the tetracycline promoter-operator system so that, 
when subsequentiy cloned into an expression plasmid, plas- 
inids containing the insert can be located by the restoration 
of tetracycline resistance. Because the EcoRI end of the 
fragment has been filled in by the Klenow polymerase 1 
procedure, the fragment has one blunt and one sticky end, 
ensuring proper orientation when later inserted into an 
expression plasmid. 

The expression plasmid pTrpl4 was next prepared to 
receive the HGH gene-containing fragment prepared above. 
Thus, pTrpl4 was Xbal digest and the resulting sticky ends 
filled in with the Klenow polymerase I procedure employing 
dATP, dTTP, dGTP and dCTR After phenol and chloroform 
extraction and ethanol precipitation the resulting DNA was 
treated with BamHI and the resulting large plasmid fragment 
isolated by PAGE and electroelution. The pTrpl4-dcrivcQ 
fragment had one blunt and one sticky end, permitting 
recombination in proper orientation with the HGH gene 
containing fragment previously described. 

The HGH gene fragment and the pTrpl4 AXba-BamHl 
fragment were combined and ligated together under condi- 
tions similar to those described above. The filled in Xbal and 
EcoRI ends ligated together by blunt end ligation to recreate 
both the Xbal and the EcoRI site: t,0190 

This construction also recreates the tetracycline resistance 
gene. Since the plasmid pHGH 107 expresses tetracycline 
resistance from a promoter lying upstream from the HGH 
gene (the lac promoter), this construction, designated pHGH 
207, permits expression of the gene for tetracycline resis- 
tance under the control of the tryptophan promoter-operator. 
Thus the ligation mixture was transformed into E. coli 294 
and colonies selected on LB plates containing 5 |Jg/ml 
tetracycline. 

Plasmid pHGH 207 was EcoRI digested and the trp 
promoter containing EcoRI fragment recovered by PAGE 
followed by electroelution. Plasmid pBRHl was EcoRI 
digested and the cleaved ends treated with bacterial alkaline 
phosphatase ("BAP") (1 |ig, in 50 mM tris pH 8 and 1 0 mM 
MgClj for 30 min. at 65° C.) to remove the phosphate 
groups on the protruding EcoRI ends. Excess bacterial 
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alkaline phosphatase was removed by phenol extraction, 
chloroform extraction and ethanol precipitation. The result- 
ing linear DNA, because it lacks phosphates on the protrud- 
ing ends thereof, will in ligation accept only inserts whose 
complementary stick ends are phosphorylated but wOl not 5 
itself recircularize, permitting more facile screening for 
plasmids containing the inserts. 

The EcoRI fragment derived from pHGH 207 and the 
linear DNA obtained from pBRHl were combined in the 
presence of T4 ligase as previously described and ligated. A 
portion of the resulting mixture was transformed into E. coli 
strain 294 as previously described, plated on LB media 
containing 5 ng/ml of tetracycline, and 12 tetracycline 
resistant colonies selected. Plasmid was isolated from each 
colony and examined for the presence of a DNA insert by 
restriction endonuclease analysis employing EcoRI and 
Xbal. One plasmid containing the insert was designated 
pHKYl. 

The plasmid pHKYlO, described above, is a derivative of 
pBR322 which contains a Bgl II site between the tetracy- 
cline resistance (Tc^) promoter and structural gene (32). The 20 
large DNA fragment isolated after digesting pHKYlO with 
Pst 1 and Bgl 11 therefore contains part of the ampicillin 
resistance (Ap^) gene and all of the Tc^ structural gene, but 
lacks the Tc^ promoter (HG. 6). The plasmid pGH6 (21) 
was digested with Eco RI, the resulting single stranded ends 25 
were filled in with DNA polymerase I, and the plasmid was 
cleaved with Pst I. The small fragment, containing part of 
the Ap'^ gene, a double lac promoter and lac ribosome 
binding site, but lacking an ATG initiation triplet was 
isolated. A similar trp promoter fragment, containing the trp 30 
leader ribosome binding site, but lacking an ATG sequence 
(30), may be isolated from pHKYl described above; see 
(32) (see HG. 6). 

The trp fragment just referred to is an analog of the E. coli 
tryptophan operon from which the so-called trp attenuator 35 
has been deleted, See J. Bad. 133, 1457 (1978), to control- 
lably heighten expression levels. Expression plasmids con- 
taining the modified trp regulon can be grown to predeter- 
mined levels in nutrient media containing additive 
tryptophan in quantities suiEcient to repress the promoter- 40 
operator system, then be deprived of tryptophan so as to 
derepress the system and occasion the expression of the 
intended product. 

The expression plasmids may be assembled via three part 
ligation reactions as shown in FIG. 6. 15 ng ("0.05 pmole) 45 
of the assembled HP gene (504 or 505 bp), 0.5 ng ("0.2 
pmole) of the large Pst I-Bgl H fragment of pHKYlO and 0.2 
pg ("0.3 pmole) of the appropriate promoter fragment were 
ligated and the mixture used to transform E. coli 294 (22). 
Plasmid DNA was prepared from individual transformants 50 
and analyzed by restriction mapping. Correct joining of the 
assembled gene to the promoter fragment should restore the 
Eco RI (lac) or the Xba I (trp) recognition sequences. The 
majority of the plasmids gave the expected restriction 
enzyme digestion patterns. Individual clones (12 containing 55 
the trp promoter and 12 containing the lac promoter) were 
grown and extracts prepared for interferon assay as 
described in Materials and Methods. 

When assayed on human amnion (WISH) cells for anti- 
viral activity by the CPE inhibition assay (1) five of the trp 60 
transformants were positive (each approximately equiva- 
lent); eleven of the lac transformants gave equivalent IF 
activities. Therefore, one transformant from each series 
(pFIFlac9 and pFIFtrp69) was selected for further study 
(Table 1). DNA sequence analysis demonstiated that the 65 
desired attachment of promoter to FIF structural gene had 
occurred in both cases. t,0210 



The amounts of fibroblast interferon produced by 
pFIFlac9 and pFIFtrp69 are shown in Table 1. The trp 
promoter gave a ITF expression level measurably higher 
than did the lac promoter. In an attempt to further increase 
FIF expression levels, pFIFtrp69 was cleaved with Eco RI 
and two 300 base pair Eco RI fragments containing the trp 
promoter (30) were inserted. The resulting plasmid, 
pFIFtrp^69, contains three successive trp promoters which 
read toward the FIF gene. The amount of FIF synthesized by 
E. coli K-12 sti-ain 294/pFIF tip^69 is 4-5 times that 
produced by pFIF trp 69 (Table 1). This is apparently due to 
the derepression of the trp promoter which occurs when trp 
repressor levels are titrated by the multiple copies of the trp 
operator. 

The FIF produced by E. coli K-12 stirain 294/pFIFtTp69 
behaves like authentic human FIF. As shown in Table 2, its 
antiviral activity is about 30 times greater on human cells 
than bovine cells. In addition, the bacterially produced FIF 
is stable to treatment at pH 2 overnight and is not neuttalized 
by rabbit antihuman leukocyte interferon antibodies (Table 
3). t,0220 t,0221 
Purification 

The purification procedure for bacterial derived fibroblast 
is as follows: 

1. Frozen cells are suspended in twelve times volume per 
weight with sucrose lysis buffer (100 mM Tris-HCl, 10 
percent sucrose, 0.2M NaCl, 50 mM EDTA, 0.2 mM 
PMSF, pH 7.9) containing lysozyme at 1 mg mr\ The 
cell suspension is stirred for 1 hour at 4° C. and centri- 
fuged. Fibroblast interferon activity remains in the super- 
natant. 

2. Polyethyleneimine (5 percent v/v) is added to the soni- 
cated supernatant to a final concentration of 0.5 percent 
(v/v). The solution is stirred for 1 hour at 4° C. and 
centrifuged. Interferon activity remains in the superna- 
tant. 

3. Solid ammonium sulfate is added to the polyethylene- 
imine supernatant to a final concentration of 50 percent 
saturation, stirred for 30 minutes at 4° C. and centrifuged. 
Interferon activity is in the 50 percent pellet. 

4. The 50 percent ammonium sulfate pellet is suspended in 
one half the volume of the 50 percent ammoirium sulfate 
suspension with Phosphate Buffered Saline (20 mM 
sodium phosphate 0.15M NaCl, pH 7.4). Polyethylene 
glycol 6000 (50 percent w/v in PBS) is added to a final 
concentration of \2Vi percent (v/v), stirred at 4° C. for 2 
hours and centrifuged. Interferon activity is in the pellet. 
The pellet is suspended in a miirimal volume of sucrose 
lysis buffer and clarified by centiifugation. 

This initial extraction procedure results in a purification of 
fibroblast interferon from 0.001 percent of the total protein 
to 0.05 percent of the total protein. This material can be 
further purified to homogeneity by the following column 
chromatography steps: 

5. Afinity chromatography on Amicon Blue B in sucrose 
lysis buifer. 

6. Anion exchange chromatography on QAE Sephadex in 
sucrose lysis buffer in the absence of 0.2M NaCl. 

7. Size exclusion chromatography on Sephadex G-75 in 
sucrose lysis buffer. 

8. Reverse phase high pressure liquid chromatography. 
Parenteral Administration 

FIF may be parenterally administered to subjects requir- 
ing antitumor or antiviral tireatment. Dosage and dose rate 
may parallel that currently in use in clinical investigations of 
human derived materials, e.g., about (l-lO)xlO'' units daily, 
and in the case of materials of purity greater than 1 percent- 
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age, likely up to, e.g., 150x10*^ units daily. Dosages of 
bacterially obtained FIF could be significantly elevated for 
greater effect owning to the essential absence of human 
proteins other than FTP, which proteins in flbroblast-derived 
materials may act as pyrogens, exhibiting adverse effects, 5 
e.g., malaise, temperature elevation, etc. 

As one example of an appropriate dosage form for essen- 
tially homogeneous bacterial FIF in parenteral form, 3 mg. 
FIF of specific activity of, say, 2x10** p/mg may be dissolved 
in 25 ml. 5 percentage serum albumin (human) - USP, the 10 
solution passed through a bacteriological filter and the 
filtered solution aspetically subdivided into 100 vials, each 
containing 6x10^ units pure interferon suitable for parenteral 
administration. The vials are preferably stored in the cold 
(-20° C.) prior to use. 15 

The compounds of the present invention can be formu- 
lated according to known methods to prepare pharmaceuti- 
cally useful compositions, whereby the polypeptide hereof is 
combined in admixture with a pharmaceutically acceptable 
carrier vehicle. Suitable vehicles and their formulation are 20 
described in Remington' s Pharmaceutical Sciences by E. W. 
Martin, which is hereby incorporated by reference. Such 
compositions will contain an effective amount of the inter- 
feron protein hereof together with a suitable amount of 
vehicle in order to prepare pharmaceutically acceptable 25 
compositions suitable for effective administration to the 
host. One preferred mode of administration is parenteral. 
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We claim: 

I. A composition comprising water and a nonglycosylated 
polypeptide having the amino acid sequence of a mature 
human fibroblast interferon, said nonglycosylated polypep- 
tide having a total of 165 or 166 amino acids and said 
composition being free of any glycosylated human fibroblast 
interferon. 

2. The composition of claim 1, said nonglycosylated 
polypeptide having the amino acid sequence 

X-Ser-Tyr-Asn-Leu-Leu-Gly-Phe-Leu-Gln-Arg-Ser-Ser- 
Asn-Phe-Gln-Cys-Gln-Lys-Leu-Leu-Trp-Gln-Leu- 
Asn-Gly-Arg-Leu-Glu-Tyr-Cys-Leu-Lys-Asp-Arg- 
Met-Asn-Phe-Asp-Ile-Pro-Glu-Glu-IIe-Lys-Gln-Lcu- 
Gln-Gln-Phe-Gln-Lys-Glu-Asp-Ala-Ala-Leu-Tlir-Ilc- 
Tyr-Glu-Met-Leu-Gln-Asn-Ile-Phe-Ala-Ile-Phc-Arg- 
Gln-Asp-Ser-Ser-Ser-Thr-Gly-Trp-Asn-Glu-Thr-Ile- 
Val-Glu-Asn-Leu-Leu-Ala-Asn-Val-Tyr-His-Gln-llc- 
Asn-His-Leu-Lys-Thr-Val-Leu-Glu-Glu-Lys-Leu-Glu- 
Lys-Glu-Asp-Phe-Thr-Arg-Gly-Lys-Leu-Mct-Ser-Scr- 
Leu-His-Leu-Lys-Arg-Tyr-Tyr-Gly-Arg-Ile-Leu-His- 
Tyr-Leu-Lys-Ala-Lys-Glu-Tyr-Ser-His-Cys-Ala-Trp- 
Thr-Ile-Val-Arg-Val-Glu-Ile-Leu-Arg-Asn-Phe-Tyr- 
Phe-Ile-Asn-Arg-Leu-Thr-GIy-Tyr-Leu-Arg-Asn, 
wherein X is H or Mel. 

3. The composition of claim 2, said nonglycosylated 
polypeptide having a formula molecular weight of about 
20,027. 
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4. The composition of claim 1, 2 or 3, said composition 
being free of human proteins. 

5. The composition of claim 1, 2 or 3, said composition 
containing a therapeutically eifective amount of said nong- 
lycosylated polypeptide and being suitable for parenteral 
administration. 
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18 

6. The composition of claim 4, said composition contain- 
ing a therapeutically effective amount of said nonglycosy- 
lated polypeptide and being suitable for parenteral admin- 
istration. 
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Table 1. 



Interferon activity in extracts of E. col i 



E. coll K-12 
strain 294 
transformed by; 



Cell density 
(cens/ml) 



IF Activity 
(units/1 culture) 



FIF molecules 
per eel 1 



pBR322 

pFIFlac9 

pFIFtrp69 



3.5 X 108 
3.5 X 108 
3.5 X 108 



pFIFtr£359 3.5 x lO^ 



9.0 X 106 
1.8 X 107 

8.1 X 107 



2,250 
4,500 
20,200 
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Table 2. Interferon activities measured on different eel] types 





Interfere 


n Activity 


(uni ts/ml ) 


Cells 


LeIF 


FIF 


E. coli K-12 strain 
794/pFIFtrp69 extract 


Human amnion 


20,000 


10,000 


1280 


Bovine kidney 


13,000 


400 


40 



LeIF and FIF were UIH standard solutions having 20,000 units/ml and 
10,000 units/ml respectively. Assays were performed as described in 
Materials and Methods. 
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Table 3 . Conpan'son of activities of extracts from E. coT i K-12 

strain 294/pFIFtrp69 v/ith standard human leukocyte and 
fibroblast interferons 



Interferon Activity (units/ml) 
LeIF 



FIF 



E. coH K-12 strain 
7947pn'Ftrp69 



untreated 


1000 


1000 


1000 


pfl2 


1000 


1000 


1000 


rabbit antihuman 


<16 


1000 


1000 


LeIF antibodies 
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Human leukocyte and fibroblast interferons 
are structurally related 

Tadatsugu Taniguchi*, Ned Manteif, Marco Schwarzsteint, Shigekazu Nagataf, 
Masami Muramatsu* & Charles Weissmannt 

•Department of Biochemistry, Cancer Institute, Japanese Foundation for Cancer Research, Tokyo 170, Japan 
jlnstitut fijr Molekularbiologie I, Universitat Zurich, 8093 Zurich, Switzerland 



The coding sequences of the cDNAs of cloned human leukocyte interferon I and human fibroblast interferon show 
homologies of 45% at the nucleotide and 29% at the amino acid level. We conclude that the two genes were derived 
from a common ancestor. 



The acid-stable human interferons are subdivided into two 
major groups, fibroblast interferons (F-IF) and leukocyte 
interferons (Le-IF); these are the major components of the 
interferons produced by induced fibroblasts and leukocytes, 
respectively. Some cells, such as the lymphoblastoid Namalva 
cell line, produce a mixture of 90% Le-IF and 10% F-IF'-^. 
The two interferon types have several features in common: 
both are glycoproteins with molecular weights ranging from 
16,000 to 26,000 (refs 3-9), the induction and shut-off of their 
synthesis seem, to be under similar control*, and at least some 
of the responses elicited in target cells are similar, such as 
induction of an antiviral state, which is accompanied by 
increased synthesis of several proteins'""'^. Nonetheless, the 
two kinds of interferon differ in many respects. Antibodies 
directed against Le-IF do not neutralize F-IF and vice versa'", 
the target cell specificities of the two interferons dilTer", and 
the sequences of the 13 amino-terminal amino acids of F-IF 
and Le-IF (from lymphoblastoid cells) show no 
homology'*'". Although Le-IF and F-IF are encoded by 
different mRNA species'", it is not known whether these 
mRNAs are transcribed from distinct genes or whether they 
arise from the same gene through a common precursor which 
is processed or spliced in different modes. 

We have recently cloned and sequenced one species each of 
Le-IF (Le-IF I)"'" and F-IF cDNA^''". A second species of 
Le-IF (Le-IF 11) cDNA has recently been identified (M. 
Streuli, S.N. and C.W., unpublished results). 

Comparison of the amino acid sequences 
of Le-IF and F-IF 

In Fig. I the nucleotide sequences of Le-IF I and F-IF cDNA- 
were aligned so that the AUGs closest to their 5' termini 
coincided. From the amino-terminal sequence published for F- 
IF'* and lymphoblastoid Le-IF", one can determine that for 
F-IF and Le-IF, respectively, the 21st and 23rd codons 
following the initiation triplet represent the first amino acid of 
the interferon polypeptide. Presumably, the stretch in between 
encodes a signal peptide. As the respective putative signal 
peptides of Le-IF and F-IF comprise 23 and 21 amino acids, 
the IF polypeptides, as aligned in Fig. 1, are shifted by two 
residues relative to their termini. In this alignment, 48 of 166 
positions (29%) of the interferon polypeptides have identical 
amino acids. By introducing appropriate gaps, better 
homology could be achieved, particularly in the region of the 
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signal sequence; in the present comparison this has not been 
done. 

To plot the degree of homology between the F-IF and Le- 
IF as a function of the map distance, the sequence was 
subdivided into segments of 8 amino acids (or 24 nucleotides), 
each overlapping by 4 amino acids (or 12 nucleotides) with the 
neighbouring segments, and the per cent coincidence of amino 
acids (and nucleotides) for each segment was deterfnined (see 
ref. 23). As seen in Fig. 2, amino acid sequences show three 
domains of homology. The first one, with the least degree of 
homology, corresponds to the putative signal sequence, which 
is rich in hydrophobic residues and has 4 identical amino acid 
positions out of 21; the second domain, between amino acids 
28 and 80 (counted on the Le-IF sequence), has 21 identical 
residues out of 51 (41% homology), and the third, between 
positions 115 and 151 (Le-IF sequence), has 19 out of 35 
identical residues (54%). the longest stretches of contiguous 
conserved amino acids are Gln-Phe-Gln-Lys (positions 47-50 
of Le-IF and 49-52 of F-IF) and Cys-Ala-Trp (positions 139- 
141 and 141-143, respectively); the latter sequence is notable 
because it comprises Cys and Trp, which are preferentially 
conserved in related proteins". Table 1 shows that 
conservation was highest between the interferon polypeptides 
(not considering the signal sequences) for Trp, Phe, Arg, Cys 
and Tyr residues, in agreement with the general experience 
that the amino acids most likely to be conserved between 
related proteins are Trp > Cys > Tyr > Arg > Phe, His (ref. 24). 
Even where amino acids are conserved, the codons show one 
or more nucleotide changes in half the instances. The codons 
of three out of seven conserved Leu residues are unrelated, as 
are two of four codons pertaining to conserved Ser residues. 
This suggests that there is a strong selective pressure favouring 
the conservation of several amino acids. It is quite likely that 
at least some of the conserved amino acids are essential for a 
function common to Le-IF and F-IF, perhaps the induction of 
the virus-resistant state in the target cell. These findings may 
provide guidelines for the tailoring of modified^^ possibly , 
shorter polypeptides possessing certain activities of interferon. 

Comparison of the nucleotide sequences 
of Le-IF and F-IF 

The nucleic acid sequences show an average homology of 43 % 
in the domain of the signal sequence and of 45% m the 
interferon polypeptide sequence. On a random b.iM-,^ about 
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G CT CTA GGT TCA GAG TCA CCC ATC TCA GCA AGC CCA GAA GTA TCT GCA ATA TCT ACG ATG GCC TCG CCC TTT GCT TTA CTG ATG GTC CTG GTG GTG CTC ; 
23 



ALA SER PRO PHE ALA 



THR ASN LYS CYS LEU 



LEU MET VAL 



GLN ILE ALA 



VAL VAL LEU 



LEU LEU CYS f 



GTC AAC ATG ACC AAC AAG TGT CTC CTC CAA ATT GCT CTC CTG TTG TGC 1 



S20 1 10 20 

TGC AAG TCA AGC TGC TCT CTG GGC TGT GAT CTC CCT GAG ACC CAC AGC CTG GAT AAC AGG AGG ACC TTG ATG CTC CTG GCA CAA ATG AGC AGA ATC TCT Cci 

LEU GLY CYS ASP LEU PRO GLU THR HIS SER LEU ASP ASN ARG ARG THR LEU MET LEU LEuIaLA GLN MET SER AR6 ILE SER PRC 



CYS LYS SER SER CYS 



SER THR THR ALA LEU 



SER 



HET SER TYR ASN 



LEU GLY PHE LEU GLN ARG SER SER ASN PHE GLN CYS GLN LYS 



LEU TRP GLN LEU ASN GLY ARG L£l 



TCC ACT ACA GCT CTT TCC ATG AGC TAC AAC TTG CTT GGA TTC CTA CAA AGA AGC AGC AAT TTT CAG TGT CAG AAG CTC CTG TGG CAA TTG AAT GGG AGG CTl 



10 



20 



30 m 50 60 

TCC TGT CTG ATG GAC AGA CAT GAC TTT GGA TTT CCC CAG GAG GAG TTT GAT GGC AAC CAG TTC CAG AAG GCT CCA GCC ATC TCT GTC CTC CAT GAG CTG ATC 



SER 


CYS LEU 


MET 


ASP ARG 


HIS 


ASP 


PHE 


GLY PHE 


PRO 


GLN 


GLU 


GLU 


PHE 


ASP 


GLY 


ASN 


GLN PHE GLN LYS 


ALA 


PRO 


ALA 


ILE SER VAL LEU HIS 


GLU 


LEU ILE 


TYR 


CYS LEU 


LYS 


ASP ARG 


l€T 


ASH 
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CCT TTA CAC TGT GGT TAG TGT AAT AAA ACA TGT TCC TTA.'TAT "TTA CTC AAA AAA A f'ig. 1 Comparison of llic nucleotide sequences of 

human leukocyte interferon 1 (Le-IF I) and human 
; fibroblast interferon cDNA and of the derived amino 

- ' ,j f,Tj TTG GAA AAT AAA TTA TIT GT v^aa''a-a GTC AAA A '^"'^ Sequences. The sequences are from Mantei ci al.-° 

• • • • - .jfij j-p ,,;22 They were aligned lo give maximal 

homology, without introducing gaps in the coding 
sequence. -Identical arnino acids are framed, identical 
nucleotides are marked by a dot. S1-S23 indicate the 
amino acids of the putative signal sequence and 1-166 
! the amino acids of the interferon polypeptides. 



Board Assigned Page #1 880 



Nature Vol. 285 19 June 1980 



549 



25% of the nucleotide positions should coincide. Within the 
interferon coding sequence, the nucleotide homologies are 
more evenly distributed than the amino acid homologies. 
However, one may distinguish, albeit to a less pronounced 
degree, the same three blocks of similarity noted for the amino 
acids. The longest region without mismatches extends for 13 
nucleotides (compare 47th to 51st codon of Le-IF with 49th to 
53rd codon of F-IF). There are, in addition, sequences of 17, 
18 and 20 nucleotides with 3, 3 and 4 mismatches, respectively. 
The heteropolymeric 3'-terminal noncoding region of Le-IF 
cDNA has 242 nucleotides, and is longer by 39 residues than 
its counterpart in F-IF cDNA. In aligning the two sequences, 
four gaps were introduced to maximize homology, as described 
by van Ooyen et al.^^. In this way, several segments were 
matched with 29-41 % homology. The introduction of gaps in 
the alignment may be justified in view of the arguments 
presented previously, that intervening sequences and 
noncoding regions of reduplicated genes diverge as a 
consequence of block insertions and/or deletions in the course 
of evolution^^'^''. It is unlikely that the extent of homology 
between Le-IF and F-IF cDNA would allow significant cross- 
hybridization between the two species. 

A common ancestral gene for 
Le-IF and F-IF 

On the basis of our findings,Hhere is no doubt that Le-IF and 
F-IF genes are derived from a common ancestral sequence. 
When did the separation of these genes occur? Human a- and 
/?-globin show 57% amino acid mismatches, and human jS- 
globin and myoglobin, as well as a-gldbin and myoglobin, 
76% mismatches. If the rate of divergence of interferons and 
globins is comparable (however, this is quite uncertain, see ref. 24, 
p. 50, for proteins showing both higher and lower rates), the 
separation of interferon genes occurred after that of myo- 
globin and haemoglobins but before that of a- and ^-globins, 
that is between 500 and 1,000 Myr ago^*, which is about the 
time vertebrates arose^''. This would mean that both types of 
interferon gene should occur in all vertebrates, unless one 
and/or the other was lost by deletion. Indeed, as shown by the 
sequencing of 13-24 amino-terminal residues'^-'"''", mouse 
interferons A and B show significant homology with human 
fibroblast interferon, and mouse interferon C with human 




0 too ' 300 ' 500 ' 700 ' 900 
Base pairs 



Fig. 2 Similarity of the nucleotide and amino acid sequences of 
human leukocyte interferon I and fibroblast interferon. The 
sequences shown in Fig. 1 were subdivided into segments of 8 
amino acids or 24 nucleotides, each overlapping by 4 and 12 
residues, respectively, with the neighbouring segments. The 
percentage of concident residues was plotted as a function of map 
position. Open vertical blocks, nucelotides; filled vertical blocks, 
amino acids. L-IF, leukocyte interferon cDNA; F-IF, fibroblast 
interferon cDNA; lines, noncoding sequences; hatched bars, 
putative signal peptide; open bars, interferon polypeptide. 



lymphoblastoid interferon, whereas the mouse species A and B 
on the one hand, and the species C on the other show no 
significant homology within the short segment sequenced. 
Thus, at least in the mouse, representatives of both interferon 
families exist. It will be of interest to determine the 
evolutionary relationship of these to the third type of 
interferon, immune or y-interferon. 

After submission of this article, we learnt that Derynck et al. 
had cloned and sequenced fibroblast interferon (see 
accompanying article^'), confirming the deduced amino acid 
sequence of T.T et alP. 

T.T. and M.M. thank Dr H. Sugano for his continuous 
support and interest. The work at the University of Ziirich 
was supported by Biogen SA and the Schweizerische 
Nationalfonds. 
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abstract Escherichia coli strain xl776 carrying re- 
combinant DNA plasmids containing cDNA copies of human 
fiory globin mBNAs has been shown by radioimmunoassay to 
synthesize polypeptides antigenically related to the |9 and y 
chains of human hemoglobin. The y and /9 polypeptides have 
been enriched from lysates on immunoabsorbent columns 
containing hemoglobin antibodies and shown to specifically 
inhibit the antigen-antibody binding between '^I-labeled he- 
moglobin and the homologous antibody but not other hemo- 
globin-antihemogldbin reactions. Clone JW15I, which is known 
to contain a complete copy of the coding portion of the y globin 
mBNA, has been shown to produce a protein that reacts spe- 
cifically with antibody to the 7 chain of fetal hemoglobin, he- 
moglobin Kenya, and hemoglobin Bart's. 

Although transcription and translation of cloned eukaryotic 
genes in a prokaryofic system have been well documented (1-7), 
one aspect of molecular cloning that has eluded most efforts is 
the expression of cloned higher eukaryotic genes in a prokar- 
yotic system. We reasoned that part of the difficulty was 
technical, and that a sensitive assay, such as radioimmunoassay, 
would be necessary to detect what might be small amounts of 
protein production. Recently two such experiments have been 
reported. Villa-Komaroff et at. (8) have reported using im- 
munological techniques to detect the production of rat proin- 
sulin cloned in Escherichia coli Xl776, using a cDNA copy of 
rat pre-proinsulin mRNA inserted in bacterial plasmid pBR322. 
Also, Mercereau-Puijalon et at. (9) have reported the synthesis 
of chicken ovalbumin-like protein in E. coli K-12 harboring a 
recombinant DNA plasmid in vector pBR322. More than 2 
years ago, we produced cDNA clones of human a, /3, and y, 
globin mRNA (10, II) inserted into bacterial plasmid pMB9. 
Here we report the expression of proteins from human cDNA 
in E. coli, by demonstrating that these proteins contain anti- 
genic determinant sites specific for human y chain and proteins 
that react with antibodies to human /? chain. This research 
shows that human gene sequences can be expressed in the 
bacterial cell and, furthermore, that bacterial plasmid pMB9 
can be used as a vector for expression as well as pBR322. 

MATERIALS AND METHODS 

Bacterial Strains, Plasmids, and Media. E. coli strain 
xl776, carrying a recombinant DNA plasmid, was grown in 
enriched medium (Luria broth) according to Curtiss et al. (12). 
Bacterial cultures were grown at 37°C to midlogarithmic phase. 
Preparation of plasmid DNA has been described (10, 11). All 
growth experiments were performed in a P3 physical con- 
tainment facility in compliance with National Institues of 
Health guidelines for recombinant DNA research (13). 

Preparation of Crude Lysates. Bacterial cells were harvested 
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by centrifugation at 1600 X g for 10 min at 5°C in an lEC B20A 
centrifuge, resuspended in phosphate-buffered saline con- 
taining Trasylol (Mobay Chemical, New York) at 60 units/ml 
and phenylmethylsulfonyl fluoride (Sigma) at 30 ng/ml, and 
sonicated, in a biological safety cabinet located in the P3 facility, 
with an Artek Sonic 300 dismembrator for 3 min or until 90% 
of bacterial cells were disrupted as measured by the reduction 
in optical density at 590 nm and by appearance in the light 
microscope. Cellular debris was removed by centrifugation at 
500 X g for 10 min, followed by a second centrifugation of the 
supernatant at 10,000 X g for 45 min. The crude lysate was 
concentrated by negative pressure dialysis against 0.1 M sodium 
borate buffer containing 0.075 M NaCl and 0.01 M sodium 
azide, pH 8.3. After concentration, proteins were fractionated 
by ammonium sulfate precipitation by the addition of an equal 
volume of 78% saturated ammonium sulfate. After incubation 
at room temperature fOr 30 min the supernatant was collected 
after centrifugation at 1600 X g for 30 min at 20°C, and the 
precipitate was resuspended in distilled water. Both the pre- 
cipitate and the supernatant fractions were dialyzed twice at 
4°C against 0.01 M ammonium bicarbonate and concentrated 
by lyophilization. The proteins were redissolved in borate 
buffer. 

Isolation and Purification of Hemoglobins. Hemoglobins 
A (02/^2) and A2(a252) were isolated from blood of normal 
donors, Hb F (0272) and Hb Bart's (74) from cord blood, 
Hb F Malta I (0272'" ""^'^'8), Hb G Philadelphia 
(a^es Asn-Lys^2), Hb S (02/^2^ Giu-Vai)^ ^^d Hb Kenya from 
individuak known to carry these variants. Hb Kenya has a fused 
non-a Hb chain consisting of an NH2-terminal corresponding 
to the 7 chain (to approximately amino acid 86) and the re- 
maining COOH-terminal portion identical to the ^ chain. 
Erythrocyte lysates were prepared and hemoglobins were iso- 
lated by DEA&cellulose chromatography as described (14, 15). 
The dilute hemoglobin fractions were concentrated by ultra- 
filtration with an Amicon ultrafiltration cell (PMIO Membrane, 
Amicon, Lexington, MA). Homogeneity of the preparations was 
analyzed by starch gel electrophoresis (15). 

Preparation and Absorption of Antisera. New Zealand 
White rabbits were immunized with the purified hemoglobins 
according to the procedure of Carver et al. (16). 

Crossreacting antibodies were eliminated by immunoad- 
sorption with hemoglobin coupled to AH-Sepharose as reported 
(17, 18). The specificity of the absorbed antisera was analyzed 
by measuring the precipitation of both the homologous ^^I- 
labeled hemoglobin and the '^I-labeled hemoglobin that was 
used for absorption. The antisera were further characterized 
by inhibition tests using unlabeled homologous and heterologous 
hemoglobin antigens. 

The preparation and specificity of antisera utilized for ra- 
dioimmunoassay is listed below and described in detail in the 
accompanying references: (a) anti-Hb F serum absorbed with 
Hb A (7 chain specific) (16); (b) anti-Hb A2 serum absorbed 
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with Hb A (6 chain specific) (16); (c) anti-Hb F Malta I serum 
absorbed with Hb F (specific for Arg at position 117 of the y 
chain) (19); (d) anti-Hb G Philadelphia serum absorbed with 
Hb A (specific for Lys at position 68 of the a chain) (17); {e) 
anti-Hb Kenya serum absorbed with Hb A (specific for the 
region of the non-a chain of Hb Kenya) (18); (J) anti-Hb Bart's 
absorbed with Hb A (specific for y chain) (unpublished data); 
and (g) anti-Hb S absorbed with Hb A (specific for Val at po- 
sition 6 of the /3 chain) (17). 

Preparation of Labeled Antigens and Radioimmunoassay. 
Hemoglobins were labeled with by the chloramine-T 
procedure (20). Unbound '^I was eliminated by gel filtration 
and dialysis (16), and the preparations were stored at 4°C. 

A standard radioimmunochemical inhibition test (16, 20) was 
used to determine the presence of immunoactive proteins from 
crude lysates of bacterial cultures. In these tests, proteins ob- 
tained from bacterial cultures were assayed for their ability to 
inhibit an ^^I-labeled hemoglobin-antihemoglobin precipi- 
tation reaction. 

Isolation of Clobin-Chain-Like Proteins. Antigenically 
active protein was isolated by affinity chromatography. The 
ariimonium sulfate supernatants of lysates were chromato- 
graphed on antibody-coupled Sepharose columns. The antibody 
had been isolated from the antisera by chromatography on 
hemoglobin-conjugated Sepharose columns, and the retained 
antibody had been dissociated with 0.05 M glycincHCl buffer, 
pH 3.5. After preparation of the antibody affinity column, the 
colunms were washed extensively with 3 M sodium thiocyanate, 
borate buffer, then glycine-HCl buffer. After washing, the 
columns were reequilibrated with borate until the absorbance 
at 280 nm was less than 0.01 prior to the addition of ammonium 
sulfate supernatant of the bacterial cell lysate. Bound protein 
was eluted from the columns by the addition of glycine buffer, 
collected, concentrated, and stored at 4°C. 

Gel Electrophoresis and Restriction Endonuclease Di- 
gestion. Procedures for restriction endonuclease digestion and 
gel electrophoresis have been described (10, 11). All restriction 
enzymes were purchased from New England Bio-Labs or Be- 
thesda Research Laboratories, Rockville, MD. 

RESULTS 

Screening of Clones. Previously we reported (11) the pro- 
duction of 54 hybrid DNA clones containing sequences com- 
plementary to human, a, /3, or y chain globin mRNA. To fa- 
cilitate the determination of clones as candidates for the pro- 
duction of immunoactive globin proteins, crude lysates of these 
clones were prepared from 30-ml cultures. Supernatants ob- 
tained from each of these clones were tested by radioimmu- 
noassay for the inhibition of the precipitation of '^I-labeled 
hemoglobin by antibody. Clones that were previously deter- 



mined (11) to contain sequences complementary to y globin 
mRNA were tested in a radioimmunochemical assay against 
Hb F, whereas clones containing sequences complementary to 
jS globin mRNA were tested in an assay against Hb A. Crude 
lysates from the majority of clones tested gave no inhibition, 
and neither did the control of E. coli carrying pMB9 plasmid 
(Table 1). Cultures showing less than 5% inhibition were scored 
as negative. Two clones, JW151 and JW109, showed slight in- 
hibition with i25Mabeled Hb F (Table 1, line 1) and i25i.labeled 
Hb A (Table 1, line 3) respectively. These results indicated that 
these clones would be likely candidates for further study. Be- 
cause such a low level of inhibition probably represents a low 
level of immunoactive protein present, experiments were 
conducted for enrichment of such proteins. 

Protein Enrichment. Proteins were fractionated from cul- 
tures of JW151 and JW109 by anunonium sulfate precipitation. 
By this procedure, 70% of the total bacterial protein was pre- 
cipitated. The ammonium sulfate precipitate and supernatant 
were tested by radioimmunoassay for inhibition. Because 70% 
of the antigenic activity was found in the supernatant, this 
fraction was used for further pruification. 

Additional purification of antigenically actice proteins was 
obtained by the use of affinity chromatography with antibody 
coupled to Sepharose. Protein from JW 151 ammonium sulfate 
supernatant (63 mg) waschromatographed on such a column, 
as illustrated in Fig. 1. This procedure yielded 7% of the total 
protein (4.4 mg) being eluted by glycine-HCl buffer, after 
having been bound by Hb F antibody. This represents 2% of 
the protein recovered from the crude lysate. Similarly, 1.5 mg 
of protein was eluted from JW109 lysate by using a Sepha- 
rose-bound anti-Hb A column. Eluted proteins were further 
tested by radioimmunoassay for inhibition. In such tests, 50 fig 
of isolated protein from JW151 gave 82.8% inhibition (Table 
1, line 2) as compared to 97^3% inhibition with O.Sng of purified 
Hb F. Bound protein (50 fig) from JW109 yielded 75% inhi- 
bition (Table 1, line 4), whereas 5 /tg of Hb A gave 91.3% in- 
hibition. Lysates of bacterial cultures carrying pMB9 plasmid 
showed that no immunologically active proteins were retained 
on the Sepharose-antibody columns (data not shown). 

Fig. 2 represents the inhibition curve obtained by radioim- 
munoassay for enriched globin specific proteins from JW151 
at various concentrations. As can be seen, proteins isolated from 
this clone showed a significant amount of inhibition. However, 
maximum inhibition obtained for protein from JW151 was 82% 
(at inputs of greater than 100 ng of protein), whereas Hb F 
controls yielded 100% inhibition with as little as 5 /ig of Hb F. 
This implies that the enriched protein from JW151 has reduced 
antigenic activity (on a weight basis) compared to Hb F controls 
but still exhibits a substantial degree of y chain antigenic 
specificity as demonstrated below. 



Table 1. Radioimmunoassay for globin chains 

Controls E. co(i carrying Clone JW151 Clone JW109 

Normal pMB9 plasmid. Crude Isolated Crude Isolated 

Antiserum serum Hb F Hb A crude lysate lysate protein lysate protein 

(AS) (NS) (5.0 Mg) (5.0 Mg) (100 Ml) (100 mD (50 Mg) (100 mD (50 Mg) 



7 chain 

1. 0% (4185) 100% (548) 90.1% (909) 2.3% 11.1% (3781) 

2. 0% (2776) 100% (580) 97.3% (645) 82.8% (957) 
j8 chain 

3. 0% (2193) 100% (397) 82.4% (714) 2.3% 11.7% (1983) 

4. 0% (2800) 100% (469) 91.3% (672) 75.0% (1050) 



Values represent percent inhibition of the precipitation of i^I-labeled hemoglobin by antibody; the actual cpm is given in parentheses. Antiserum 
(AS) control values represent maximum precipitation in absence of inhibitor. Normal serum (NS) control values (no antibody) represent nonspeciHc 
precipitation and therefore reflect expected values for 100% inhibition. Percent inhibition (%I) was calculated by the formula %I = 100 X [(AS 
— precipitated)/(AS — NS)]. Crude lysates of hybrid clones and pMB9 contained approximately equal protein concentrations (80 mg/ml). 
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Fig. 1. Protein fractionation on antibody-coupled Sepharose 
columns. Protein from the ammonium sulfate supernatant of the 
crude lysate from JW151 was applied to the Sepharose column 
equilibrated in borate buffer (arrow on left), and the column was 
washed until the A280 was less than 0.01. Bound protein (fractions 
50-60) was eluted from the column by the application of glycine-HCl 
buffer, pH 3.5. 

Analysis of Specificity of Clobin Protein. In order to de- 
termine if enriched protein from JW151 nonspecifically in- 
hibited the antigen-antibody interaction, Sepharose-isolated 
protein was analyzed in radioimmunoassays with various he- 
moglobin-specific antibodies. Table 2 illustrates the results of 
these tests. The 7 globin-like protein from JW151 gave no in- 
hibition in radioimmunoassays specific for normal 5 chains (Hb 
A2), in an assay specific for a 7 chain variant (Hb F Malta I), 
or an a chain variant (Hb G Philadelphia), but gave inhibition 
in assays for 7 chains (Hb F, Hb Bart's). In addition, the purified 
JW151 protein partially blocked the binding between '^I Hb 
Kenya and antibody to Hb Kenya, providing further evidence 
of 7 chain antigenic determinants. 

The (8 globin-like protein isolated from JW109 lysates failed 
to show any inhibition in the radioimmunoassay specific for 7 
chains. This protein also failed to show any inhibition in the 
radioimmunoassay for Hb S. 

Determination of the Direction of Clobin Gene Insertion 
in pMB9. The orientation of globin complementary sequences 
inserted within a bacterial plasmid could be of importance for 
the expression of these sequences within the cell, because their 
expression will depend on the plasmid and bacterial tran- 
scriptional mechanisms. For this reason pMB9 and various re- 
combinant DNA plasmids were digested with restriction en- 
donucleases to determine the orientation of the inserted globin 
cDNAs within the plasmids. Fig. 3 shows a detailed restriction 
endonuclease cleavage map of the a, jS, and 7 globin cDNA as 
determined from nucleotide base sequence analysis of recom- 
binant DNA plasmids (unpublished data). As can be seen from 




Fig. 2. Inhibition assay for protein obtained from Sepharose 
column. Protein obtained from JW151 was assayed, at various con^ 
centrations, for its ability to inhibit the immunochemical reaction of 
1251-labeled 7 globin (0.1 ng) and anti-7 globin. 

Fig. 3, (8 and "^7 globin cDNA contains no Hha I enzyme 
cleavage site. Furthermore, both /3 globin and *'7 globin cDNAs 
contain a single £co RI and Bam HI enzyme cleavage site. The 
globin cDNAs were inserted into pMB9 at the Eco RI cleavage 



Table 2. Inhibition of antigen-antibody reactions 

% inhibition 



JW151 
bound 



Reactants 


peak 
(25 Mg) 


Control 
(5;ug) 


Antigen 


Antibody 


126I-Hb A2 


Anti-Hb A2 


0 


97.0 (Hb A2) 


125I.Hb F 


Anti-Hb F 


67.9 


97.3 (Hb F) 




(y chain specific) 






i25i-Hb F 


Anti-Hb F Malta I 


0 


76.7 (Hi) F 


Malta I 


(yin Arg specific) 




Malta I) 


i25I-Hb Kenya 


Anti-Hb Kenya 


44.2 


100 (Hb Kenya) 




(specific for 7 








portion of non-a 








chain of 








Hb Kenya) 






125I.Hb G 


Anti-Hb G 


0 


93 (Hb G 


Philadelphia 


Philadelphia 




Philadelphia) 




(specific for ^5") 






i25i.Hb Bart's 


Anti-Hb Bart's 


32 


53 (Hb Bart's) 



Test of chain specificity of protein obtained from clone JW151. 
Values are percent inhibition of antibody precipitation of ^^I-labeled 
hemoglobin (0.1 fig) by Sepharose-isolated protein from JW151. 
Controls consisted of identical reactions in which unlabeled hemo- 
globin, specific for the antibody, was added instead of protein from 
JW151. 
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Fig. 3. Restriction endonuclease cleavage maps for the a, ft and ^7 globin genes, as determined from base sequence analysis of recombinant 
DNA plasmids (data not shown). 



site (10, 11). This site is flanked by two Hha I cleavage sites (21), 
one approximately 900 bases to the left of the EcoRI site and 
the other approximately 400 bases to the right. Previous studies 
have shown that this Hha I restriction fragment of pMB9 (Fig. 
4) is the largest fragment obtained by digestion of pMB9 DNA 
with Hha I (11) and contains all of the inserted (8 and 7 globin 
cDNAs. Therefore, for simplicity, the remaining restriction 
fragments produced by digestion of pMB9 by Hha I will hot 
be considered in this report. Digestion of /3 and ^7 globin cDNA 
plasmids with Hha I and EcoRI enzymes shows this fragment 
to be cut into two fragments of unequal size. Further digestion 
with enzyme Bam HI determines the direction of insertion of 
the globin cDNA. If the largest Hha I EcoRI fragment is cut 
by Bom HI, then the Bam HI site must be associated with the 
restriction fragment that contains 900 bases complementary 
to pMB9. Therefore, the cDNA would be inserted 5' to 3' with 
respect to the globin mRNA and with a clockwise orientation 
from the RI site to the tetracycline site in the plasmid DNA. If 
the smallest fragment is cut by Bam HI, the cDNA would be in 
a 3'-to-5' orientation. 




— pMB9 Hha I 

" y cDNA 

A'T-tailed regions 

Fig. 4. Direction in which the globin complementary sequences 
in JW151 have been inserted into bacterial plasmid pMB9. 



Fig. 4 shows that plasmid JW151 has been determined to 
have the 7 globin cDNA inserted in a 5'-to-3' direction. Re- 
striction endonuclease digestion of |8 globin plasmids JW109 
and JW102 has shown that JW109 plasmid has the globin 
sequences inserted into pMB9 in the same orientation as the 7 
globin sequences in JW151 and that JW102 (which has shown 
no antigenically active protein production) is in the opposite 
orientation. 

DISCUSSION 

Garver et al. (16) have reported the preparation of monospecific 
antibodies for the 6 and 7 chains of human hemoglobin and the 
incorporation of these chain-specific antisera into radioim- 
munoassays to quantitate hemoglobins A2 and F. These workers 
have also described the methods for preparing antibodies that 
specifically recognize the single amino acid substitution in 15 
variant human hemoglobins (17). Furthermore, Reichlin et al. 
(22) have shown that antibodies produced against native he- 
moglobin react with free a and /3 chains, and antibodies against 
the individual native subunits react with native hemoglobin. 
Therefore, antibodies produced against native hemoglobin 
should recognize antigenic sites on free chains synthesized in 
£. coli. By use of such antibodies, we have been able to dem- 
onstrate proteins, in our recombinant DNA clones of E. coli, 
showing antigenic activity. The protein production of JW151 
has been determined to be 7 specific by use of various hemo- 
globin antigen-antibody systems. Because protein isolated from 
JW151 showed no interaction with antibody to the 5 chain of 
Hb A2 but did show reactivity with antibody to the 7 chain of 
both Hb F and Hb Bart's, the protein produced must be 7 
specific and not a or 5 chain specific. Furthermore, because the 
protein exhibited close to 45% inhibition with Hb Kenya (55% 
7, 45% j8), then this protein probably contains most of the an- 
tigenic determinant sites associated with the NH2-terminal end 
of the 7 globin chain. As expected, this protein also failed to 
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show any inhibition with an antibody against Hb F Malta I that 
specifically recognizes the His Arg substitution at amino acid 
residue 117 in the y chain of Hb F Malta I. Nor did the protein 
inhibit an antibody against Hb G Philadelphia that is specific 
for the Asn — ► Lys substitution at position 68 of the a chain. 

Through the use of radioimmunoassays we were able to ob- 
tain no more than 85% inhibition with 7-like (Fig. 2) and 75% 
inhibition with ;8-like protein, preparations (data not shown). 
This may be indicative of a modified protein that lacks anti- 
genic determinant sites found on normal 7 or j3 chains. This 
may be attributed to an altered conformation of the protein due 
to the lack of globin specific sequences or to the transcription 
of excess plasmid DNA. DNA sequencing studies of the insert 
in JW151 have shown this hybrid plasmid to contain sequences 
complementary to the human y globin mRNA from preceding 
the initiator codon to sequences in the 3' untranslated region 
following the termination codon (unpublished data). With a 
complete copy of the translated region of the y mRNA present, 
one would expect any globin-like protein produced to be 
complementary to the y globin chain. Therefore, the decreased 
antigenicity may be due to the production of a hybrid protein 
that consists of both bacterial and globin sequences. This can 
be envisioned if the bacterial promoter for the transcription of 
this region of pMB9 plasmid initiates transcription upstream 
from the site of the insert, as is the case for the rat proinsulin 
system (8). The molecular weight of such proteins will not be 
that of purified 7 or j8 globin chains, because it will also be 
dependent on the extent of plasmid sequences that are co- 
transcribed with the inserted globin sequences. 

The ability of cloned eukaryotic genes to be expressed in a 
prokaryotic system is probably governed by a multitude of 
factors. One such factor may be the orientation of the inserted 
gene with respect to the direction of transcription of the plasmid 
DNA. In our studies, it has been determined that both recom- 
binant plasmids that show expression also have the cDNA in- 
serted into the plasmids in the same orientation. This may be 
indicative that this orientation is necessary for proper strand 
specificity during RNA transcription and may therefore further 
support the idea that the eukaryotic gene is using the prokar- 
yotic mechanisms for transcription. 

Note: Since the submission of this paper, Goeddel et al. (23) 
have reported the expression of synthetic human insulin genes 
cloned in E. coli by using plasmid pBR322. 

This work was supported in part by National Institutes of Health 
Grants HLB 15158 and HLB 23294. 
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ABSTRACT Flasmids are constructed by using recombi- 
nation in vitro according to Roberts, T. M., Kacich, R. & Ptashne, 
M. (1979) Proc. Nat/. Acad. Sci. USA 76, 760-764 in which the 
t antigen gene of simian virus 40 is fused to a promoter of the 
Escherichia coli lac operon. In the fusions, transcription com- 
mences at the lac promoter, and, in some of the fusions, trans- 
lation begins at the ATC initiator codon of the t gene. This 
translation is directed most efficiently by those plasmids in 
which the lac sequences abut the ( gene such that a hybrid 
ribosome binding is encoded. In this case, the Shine-Dalgamo 
sequence is of 7ac origin but the ATG derives from the ( gene. 
Translation from this initiator codon is greatly decreased if the 
lac sequences are separated from the ATC by 17 base pairs and 
is abolished if the AT of this triplet is deleted. Cells bearing the 
productive fusions synthesize a 20,000-dalton protein with t 
antigenic determinants. This protein has an isoelectric point(s) 
indistinguishable from that of t antigen isolated from simian 
virus 40-transformed cells. Moreover, a partial sequence of the 
amino-terminal region of the bacterial product is that predicted 
for authentic t antigen. We conclude that these bacteria are 
producing a protein, the sequence of which is identical to that 
of authentic t antigen unfused to other polypeptides. 

The T antigens of simian virus 40 (SV40) are virus-encoded 
proteins that play a central role in the process of virus-induced 
neoplastic transformation (1-8). One of these proteins, the t 
antigen (t), is a 174-amino acid polypeptide whose coding se- 
quence is known. Unlike the related protein SV40 T antigen 
(T), it is cytoplasmic and does not bind to DNA in vitro (ref. 1; 
J. Griffin and D. Livingston, unpublished results). Analysis of 
t function in vitro has been limited by the fact that it is difficult 
to purify from SV40-infected cells. The DNA encoding t has 
no intervening sequences and, if supplied the appropriate cis- 
acting control sequences, its expression in a bacterium should 
produce a polypeptide identical in sequence to t. 

In this paper, we describe the construction of plasmids that 
direct the synthesis of apparently authentic SV40 1 in bacterial 
cells. The essential feature of the construction is the fusion of 
the t gene to an Escherichia coli DNA fragment bearing a 
promoter and a so-called Shine-Dalgamo (SD) sequence (9-11), 
both from the lac operon. The former insures efficient tran- 
scription of the t gene. The latter, when positioned at the ap- 
propriate distance from the ATG signaling the translational 
start, directs binding of the mRNA to ribosomes and proper 
initiation of translation. We refer to such a regulatory sequence 
as a "hybrid ribosome binding site". The product of translation 
is not a bacterial-eukaryotic fusion protein. Rather, our ex- 
periments strongly suggest that, as expected, it has the primary 
structure of t. The approach we use to position the DNA frag- 
ment containing the bacterial control sequences is based on the 
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principle and methods utilized to maximize the production of 
two regulatory proteins encoded by phage X, repressor and cro 
protein (12, 13). 

MATERIALS AND METHODS 

Animal Cells and Viruses. The SV80 strain of SV40-trans- 
formed human fibroblasts (14) served as a source of authentic 
t. SV40, strain 777, was grown and its DNA was isolated as de- 
scribed (15). Protein was labeled in vivo with ['^SJmethionine 
(specific activity, 400-600 Ci/mmol; 1 Ci = 3.7 X lO^o bec- 
querels) at a concentration of 50-100 ^Ci/ml as described 
(16). 

Bacterial Cells and Radiolabeling of Protein. £. coli strains 
MM294 {endo r, B", nT, m]+) and RB113-(recA13, uvrA6, 
thi-1, thrl, leuB6, proA2, argES, his-4, mtl-1, xyl-5, tsx33, 
strASl, gal2, aral4, lacYl ) were used. Labeling of total cell 
protein was carried out in MM294 by using Na'^S04 under the 
condition of Roberts and Roberts (17). Plasmid-coded proteins 
were labeled in RBI 13 by using [^]methionine under the 
conditions of Sancar et al. (18) as modified by R. Brent (un- 
published), the 'maxicell technique'. This technique specifically 
labels with radioisotope plasmid-coded protein. A typical ex- 
periment is performed by hghtly U V irradiating recA ~ uvrA ~ 
cells bearing the plasmid of interest. The cells are then incu- 
bated in nutrient medium for 3 hr at which time cycloserine 
is added to 100 /tg/ml. The incubation is then continued for 
10-14 hr during which time most of the host chromosomal (but 
not plasmid) DNA is destroyed. Finally, the cells are washed 
in sulfur-free medium and incubated in the presence of ['^S]- 
methionine for another 2 hr. For details, see Sancar et al. 
(18). 

Plasmid Construction and Analysis. Enzymes and tech- 
niques for plasmid construction and analysis have been reported 
(12, 13, 19). All plasmids were constructed and propagated 
under conditions conforming to the standards outlined in the 
National Institutes of Health Guidehnes for Recombinant DNA 
Research. 

T Antigen Immunoprecipitation and Gel Electrophoresis. 

Total cell protein from SV80 cells was labeled with C^Slme- 
thionine, extracted, and immunoprecipitated either with serum 
from hamsters bearing SV40-induced tumors or with serum 
from tumor-free animals as described (16). For the immu- 
noprecipitation of t, polypeptides from E. coli extracts were 
prepared and immunoprecipitated in the presence of an excess 
of unlabeled E. coli 294 extract as described by Roberts and 
Roberts (17). Sodium dodecyl sulfate/polyacrylamide gel 
electrophoresis was as described by Laemmli (20). After fixa- 
tion, staining, and destaining (where indicated), gels were 



Abbreviations: SV40, simian virus 40; kd, kilodaltons; SD, Shine- 
Dalgarno; t, t antigen; T, T antigen. 
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Fig. 1. Schematic representation of plasmid construction. The approximate location of several restriction endonuclease cleavage sites 
are shown for a large fragment of the plasmid pBR322, for an S V40 DNA fragment bearing the t gene, and for a DNA fragment bearing the promoter 
(Pi„c) and SD sequence of the lacZ gene (SDi). [See Backman and Ptashne (12) for the source of this fragment.] The position of the origin of 
replication (on) and |8-lactamase gene of pBR322 (AMP) are indicated. ATG and TAA are the start and stop signals for translation of t. Con- 
struction steps were as foUows: (i) the t gene was cloned in pBR322. SV40 DNA was cleaved with Hpa II and the resulting sticky ends were rendered 
flush with DNA polymerase I and the four deoxynucleotide triphosphates (these "filled-in" restriction sites are designated with parentheses). 
Subsequent cleavage with Bam generated a 3(XX)-base pair fragment carrying the f gene. This fragment was joined to a pBR322 backbone generated 
by R\ cleavage, treatment with polymerase I to fill in the R\ end, and finally Bam cleavage. In the resulting plasmid, pTR405, the filled-in R\ 
site of pBR322 has been fused to the filled-in Hpa II site on SV40 to regenerate an R\ site. (I'i) A portion of pTR405 containing two unwanted 
Hindlll sites was deleted: pTR405 DNA was digested to completion with i4cc I and partially digested with Hindlll. The resulting sticky ends 
were rendered flush and the various digestion products were separated by agarose gel electrophoresis. The correct partial digestion product 
(the ends of which are designated by long arrows inside the pTR405 circle) was isolated from the gel, recircularized with T4 DNA ligase, and 
used to transform E. coli. In the resulting plasmid, pTR410, the first ifindlll site downstream from the t gene has been fused to the Acc I site 
near the origin of pBR322 replication. Thus, pTR410 has only one HindlU site located just upstream from the translational start of t. In 6 , plasmid 
pTR410 and derivatives of it are represented linearly. {Hi) pTR410 was opened with HindlU, and varying amounts of DNA were removed with 
Exo III and SI nuclease, (iv) The resected plasmids were cut with Ri, and the DNA fragment bearing the promoter and SD sequence of the lac 
operon was inserted. The product is a series of plasmids bearing the lac promoter and its associated SD sequence at various distances from the 
SV40 1 sequence. 



usually dried and autoradiographed on Kodak XR-1 film. On 
occasion, gels were impregnated with 2,5-diphenyIoxazole and 
fluorographed by the method of Bonner and Laskey (21). 

Two-Dimensional Gel Electrophoresis. Two-dimensional 
gel electrophoresis was performed by the nonequilibrium 
technique, essentially as described by Crawford and O'Farrell 
(22). A 12.5% polyacrylamide slab running gel was utilized in 
the second dimension. Gels were prepared for f luorography, 
dried, and exposed to x-ray film as described above. 

Automated Amino Acid Sequence Analysis. The 20 kil- 
odalton (kd) protein was labeled with ['^Jmethionine in cells 
carrying pTR436 by using the "maxicell" technique (18). It was 
additionally purified by immunoprecipitation and sodium 
dodecyl sulfate gel electrophoresis. The material was eluted 



from the gel, and 50,000 cpm were analyzed in a Beckman 
890B Automated Sequenator by using a 0.1 M Quadrol program 
(23). The radioactivity produced at each cycle was measured 
in a liquid scintillation counter. Unlabeled apomyoglobin (700 
Hg) was added to the initial sample as an internal standard. 

RESULTS 

Plasmid Construction, Screening, and Initial Character^ 
ization. Fig. 1 shows the method used to construct plasmids that 
direct synthesis of t in bacteria. The essential steps were: (») a 
SV40 DNA fragment containing the t gene was cloned in 
pBR322 to produce pTR405. (ii) To delete unwanted restriction 
sites, pTR405 was reduced in size by restriction and ligation to 
produce pTR410. The important features of this plasmid are 
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mRNA SD/ac 

Iaattgtgagcgataacaatttcacac 
ttaacactcgctattgttaaa6tgt6 



Met AspLysVa I 



AG6A AACAG AGCTTTGCAAAG ATG 6ATAAA6TT 
TCCTt TTGTClTCGAAACGTTTCrrAClCTATTTCAA 

pTR440t 



pTR436 



pTR431 



Fig. 2. Sequences around beginning of t gene in a series of fusion plasmids. Shown is a portion of the sequence of pTR422 extending from 
the startpoint of transcription of the lac promoter on the left (straight arrow) to the startpoint of translation of the t gene on the right. Also 
shown is the SD sequence of the lac operon. All plasmids produced by the Exo III/Si method may be considered deletions of pTR422. The extent 
of these deletions in three of the plasmids is indicated by brackets. 



a unique Hindlll site just (14 base pairs) upstream from the 
start point of translation of the t gene and a unique EcoRl site 
further (400 base pairs) upstream, (in) pTR410 DNA was 
cleaved with Hindlll and shortened to varying extents with 
exonuclease III and SI nuclease. In this way, varying amounts 
of the DNA between the Hindlll site and the start-point of 
translation of t were removed. (it>) The resected plasmid was 
cut with EcoRl, and a DNA fragment bearing the prokaryotic 
control signals was inserted into the plasmid backbone. Because 
this fragment was bounded by EcoRl and Alu ends, its direc- 
tion of insertion was fixed. 

Plasmids constructed as described in Fig. 1 were used to 




transform E. coli, and ampicillin-resistant clones were screened 
for production of t by immune precipitation. Approximately 
half the clones produced a protein of approximately 20 kd 
which was specifically precipitated by anti-T sera. The nucle- 
otide sequence of about 80 base pairs spanning the fusion of 
bacterial and t gene sequences was determined for plasmids 
isolated from several of the clones, both positive and negative 
for immunoprecipitable 20-kd protein. 

In a separate experiment, the enzymatic resection step was 
omitted, and the sticky ends produced by Hindlll digestion of 
pTR410 DNA were rendered flush with DNA polymerase I and 
the four deoxyribonucleotide triphosphates before insertion of 
the fragment bearing the lac control sequences. The resulting 
plasmid was pTR422. 

Fig. 2 shows sequences from three of the resulting plasmids 
as well as the unresected pTR422. In pTR440 and pTR436, the 
distances separating the SD sequence of the lac operon are 11 
and 9 base pairs, respectively. In pTR431, the first two bases 
of the small t gene ATG were removed. 



Fig. 3. Immunoprecipitation of extracts of [^S]methionine- 
labeled SV80 and MM294 (pTR42l36) cells. Immunoprecipitation 
was performed on two identical aliquots of each labeled cell extract 
in parallel as described (16, 17). In one instance, 10 fil of hamster 
anti-T serum was employed, and, in the other case, 10 iil of control 
hamster serum was added. Immunoprecipitates were dissolved and 
electrophoresed in 12% sodium dodecyl sulfate/polyacrylamide gels 
with 4% polyacrylamide stacking gels, and the gels were autoradio- 
graphed. Lanes: A, control serum precipitate of MM294 (pTR436) 
cell extract; B, anti-T serum precipitate of the same extract; C, anti-T 
serum precipitate of SV80 cell extract; D, control serum precipitate 
of the same extract. 




Fig. 4. Synthesis of t directed by various fusion plasmids. The 
maxicell technique was used to specifically label protein whose syn- 
thesis is directed by four fusion plasmids. After the cells were labeled, 
they were disrupted and the contents were examined directly by 
polyacrylamide gel electrophoresis and autoradiography. The position 
of a 29-kd protein (presumably ;3-lactamase) and of a 20-kd species 
(presumably t) are indicated. The plasmids used for the experiments 
are as follows; lanes: A, pTR422; B, pTR440; C, pTR431; D, pTR 436. 
Approximately 4 X 10^ cpm of each extract was applied to the gel. 
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by the maxicell technique of Sancar et al. (18). The experiment 
of Fig. 4 utilized this technique to show that pTR436 and 
pTR440 each direct the synthesis of two proteins. One was of 
the size expected for jS-lactamase (29 kd) and the other was the 
same size as t (20 kd). This latter species was also specifically 
precipitated by anti-T serum (not shown). Both of these plas- 
mids produced comparable amounts of the 29-kd protein, but 
pTR436 directed synthesis of almost twice as much of the 20-kd 
species as did pTR440. In the former, the SD sequence was 9 
base pairs from the AUG of t, and, in the latter, it was 11 base 
pairs. In striking contrast are results with strains bearing 
pTR431 and pTR422. In both cases, jS-lactamase was produced 
but synthesis of the 20-kd protein was absent in the former and 
barely detectable in the latter. These results show that deletion 
of the A and T of the initiating ATG of t aboUshes synthesis of 
the protein and that positioning of the lac SD sequence is cru- 
cial. Fig. 4 also shows that smaller species (<10 kd) are also 
produced by our plasmids. In those cases (lanes B and D) in 
which t was produced, we suspect that at least some of these 
smaller species are breakdown products of that protein. In the 
cases (lanes A and C) in which little or no t was synthesized, a 
relatively predominant species of about 10 kd appeared, which 
we suspect is initiated at an internal ATG in the T gene. 

Comparative Two-Dimensional Gel Electrophoresis of t 
Antigen and the E. colt 20-kd Anti-T-Reactive Polypeptide. 
As assayed by two-dimensional gel electrophoresis, authentic 
t was indistinguishable from the presumptive t gene product 
made in £. coh (Fig. 5). Both formed streaks during isoelectric 
focusing and subsequently migrated as a>24-kd species in the 
second dimension. Crawford and O'Farrell (22) have shown 
that the apparent molecular weight of alkylated t is approxi- 
mately 24 kd. Protein streaking is apparently not an artifact of 
the gel system because neither tubulin nor actin, also observed 
in these immune precipitates, streaked. These two proteins have 
been found to be nonspecifically precipitated by such sera 
under these conditions (22). Control serum precipitates of the 
SV80 extract revealed only tubulin and actin spots whereas 
those of pTR436 revealed no spots (data not shown). 
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Fig. 5. Two-dimensional gel electrophoresis of bacterial and 
mammalian T antigens. T and t, labeled with P*S] methionine, were 
isolated from an extract of SV80 cells by immunoprecipitation. The 
20-kd protein was prepared from RB113 (pTR436) labeled with 
[^^SJmethionine by the maxicell technique. It was purified further 
by immunoprecipitation. Proteins were eluted, reduced, and alkylated 
with N-ethylmaleimide and subjected to nonequilibrium, two-di- 
mensional gel electrophoresis (22). Separation in the vertical di- 
mension is by size and in the horizontal, by charge. (A) SV80 immu- 
noprecipitate; 70,000 cpm was applied. (B) RB113 (pTR436) immu- 
noprecipitate; 20,000 cpm was applied. (C) Mixture of SV80 immu- 
noprecipitate (30,000 cpm) and RB113 (pTR436) immunoprecipitate 
(10,000 cpm). The gels were fluorographed and exposed for 7 days. 
One possible source of the t streaking is heterogeneous alkylation. The 
arrow denotes the 24-kd protein. The two prominent species in the 
left part of A and C are tubulin (56 kd) and actin (43 kd). 

Characterization of Plasmid-Encoded Protein. Fig. 3 shows 
that treatment of ^S-labeled total cell protein from clone 
MM294 (pTR436) with anti-T serum precipitates a protein of 
the same mobility in a sodium dodecyl sulfate/polyacrylamide 
gel as t (20 kd). The figure also shows that control hamster 
serum fails to precipitate this protein. 
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Sequencer cycle 

Fig. 6. Location of methionines in the amino-terminal segment 
of bacterial t antigen. The 20-kd protein labeled with [^^Slmethionine 
(50,000 cpm) was analyzed by automated Edman degradation. The 
product of each cycle of degradation was quantitated in a liquid 
scintillation counter. 
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Partial Amino-Terminal Sequence of the 20-kd Protein 
Product. Fig. 6 shows that, of the first 36 residues of the 20-kd 
protein isolated from bacteria, methionine is present only at 
positions 1, 14, and 30. This is known to be the case for authentic 
t (24), The analysis was performed by automated Edman 
degradation of bacterial t labeled with [^]methionine. As is 
often the case in such analyses, the yield of radioactive amino 
acid was 25-40% of the amount applied. Thus this result, in 
addition to partially confirming the expected sequence of the 
bacterial product, suggests that a significant portion, if not all, 
of the molecules have an unblocked amino terminus. 

DISCUSSION 

Our plasmid-bearing cells produce authentic t by the following 
criteria: (i) a plasmid-encoded 20-kd protein is specifically 
immunoprecipitated with anti-T serum; (ii) the protein has 
precisely the mobiUty of t in both one- and two-dimensional 
gels; {Hi) the position of methionine residues (1, 14, and 30) at 
the amino terminus of the protein is that of t; (iv) synthesis is 
abolished if the first two bases of the t initiation codon are de- 
leted, and synthesis is barely detectable if the distance between 
the lac SD sequence and the ATG of t is large (17 base pairs). 
The latter result also supports the notion that formation of a 
hybrid ribosome-binding site bearing appropriately positioned 
SD and ATG sequences is essential to translation of t. We as- 
sume, without explicit proof, that translation of t in our bacteria 
stops at the first in-phase termination codon. This UAA, which 
immediately follows the codon for amino acid 174, is presumed 
to be the termination signal for authentic t (25-27). 

The sequenator results show that at least a fraction of the 
protein bears an unmodified amino-terminal methionine, but 
we cannot exclude the possibility that some of the protein 
molecules have a modified amino terminus, t isolated from 
animal cells bears an acetylated amino-terminal methionine 
(24). It remains to be seen whether this secondary modification 
will affect the protein's activity. 

We are not certain of the level of t produced in our bacterial 
cells. Indirect estimates suggest that our best fusions produce 
roughly 1000-5000 monomers per cell. We have not system- 
atically varied the distance between the lac SD sequence and 
the ATG of t nor the sequence of the intervening bases. Such 
modifications might produce a strain that will yield significantly 
greater quantities of protein. 

Most of the reports of synthesis of higher eukaryotic proteins 
in bacteria have described the production of fusion proteins 
composed of both prokaryotic and eukaryotic sequences (28, 
29). In one case, it was suggested that the protein might be in- 
itiating at the correct ATG, but this was not directly demon- 
strated (30). Our experiments show that a message bearing a 
hybrid ribosome-binding site — i.e., sequences derived partly 
from the bacterium (the SD sequence) and partly from the 
eukaryotic gene (the ATG, etc.) — can be correctly translated 
into protein. This provides a rational approach to the problem 
of obtaining expression of eukaryotic genes in bacteria. 

We thank Daniela Faibes for expert assistance with a number of the 
experiments and Dr. Lan Bo Chen for help with two-dimensional 
electrophoresis. This work was supported by grants from the National 
Science Foundation and the National Institutes of Health to D.M.S. 
and M.P. 
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A gene sequence for the fowl plague virus {FPV) 
haemagglutinin molecule has been inserted into a bacterial 
plasmid such that its transcription is under the control of a 
promoter derived from the tryptophan operon. Such plas- 
mids direct the synthesis of a protein that reacts specifically 
with antisera to FPV haemagglutinin. Evidence is also 
presented that in some cases DMA inserted at the HmdlU 
site ofpBR322 is expressed. 



Influenza is still a major disease of man. It causes death 
primarily in the elderly, many of whom have chronic heart or 
vascular disease and other disorders. Severe infections of 
healthy individuals also occurs but in these cases morbidity, 
leading to incapacity to work, rather than mortality usually 
results. The morbidity figures however are of great economic 
importance. For example, the Hong Kong outbreak of 1968-70 
resulted in 25 million lost working days in England\ while in the 
US the effect of the Hong Kong variant of 1 968-69 was costed at 
3.8 billion dollars^ Thus, prevention of influenza epidemics and 
pandemics would be of value both economically and socially. 

The main structure involved in immunity against influenza is 
the haemagglutinin (HA) surface glycoprotein^-*. The functional 
HA subunit, one of the spikes on the virus surface, is a triangular 
rod-shaped glycoprotein with a molecular weight of ~ 250,000 
and is comprised of three HA monomers'. The HA monomers 
are synthesised as single polypeptide chains containing, in the 
case of FPV, an 18-amino acid precursor peptide at the N- 
terminus*. During maturation. and virus assembly the pre-HA is 
further processed to HAl and HA2 which remain linked by 
disulphide bridges'-^. 

Recent advances in genetic engineering and our knowledge of 
the structure and sequence of bacterial operons and control 
elements now allow the construction of new bacterial strains 
with the potential of synthesising large quantities of viral pro- 
teins (antigens) which may ultimately be useful as vaccines. In 
this article we describe the controlled production of haemag- 
glutinin antigen from an HA gene cloned in Escherichia coli ; this 
is the first step towards testing the feasibility of large scale 
antigen production by these means. 

Experimental design 

We have previously cloned at the Hindlll site of pBR322 a 
DNA fragment containing the control region of the E. coli 
tryptophan operon, and coding for the ribosome binding site and 
first seven amino acids of trpE. From this, we have produced a 
series of vectors, pWTlll, pWT121 and pWTlSl, capable of 
ensuring that any inserted DNA is read in the correct phase . 

We recently cloned and sequenced the gene for the FPV 
haemagglutinin protein*. The gene was cloned using H/ndlll 
linkers and can therefore be transferred between vectors. From 
the nucleotide sequences of the FPV HA gene and the Hindlll 
sites of the pWT series, it was clear that, if inserted at the Hindlll 
site of pWTl 2 1 in the correct orientation, the HA gene would be 
translated by readthrough from the trpE fragment (Fig. lA). 

The protein resulting from initiation at the trpE AUG would 
be a hybrid consisting of the following fragments in order: (1) an 



N-terminus of 7 amino acids from anthranilate synthetase; (2) 6 
amino acids specified by linker DNA; (3) 6 phenylalanine 
residues from the (T)ic, region of the FPV cloned DNA; (4) 7 
amino acids from the normally non-translated 5'-portion of the 
HA gene- (5) 558 amino acids comprising the haemagglutinm 
protein and its prepeptide* and, finally; (6) 5 amino acids 
specified by HmdIII linker at the C-terminus'. This is a total of 
589 amino acids with a total MW of 69,000. 

It was also clear that the HA gene should not be expressed 
from the initiator AUG of the trpE fragment in pWTl 11 or 
pWTl 3 1 . Recognition of the ribosome binding site and initiator 
AUG of the HA was not anticipated. Similarly, we did not 
expect expression of the HA gene inserted at the Hindlll site of 
pBR322 in either orientation. 

Construction of expression plasmids 

pWT121 was restricted with Hindlll, treated with alkaline 
phosphatase, ligated to the purified HA gene and the mixture 
used to transform E. coli K12 HBIOI. A total of 47 ampiciUin 
resistant transformants was obtained (pWT121 in the absence of 
the HA gene produced 13 colonies). Of these 47 colonies, 19 
were tetracycline sensitive (Tc^) and the remainder tetracycline 
resistant (Tc"*). We selected 12 colonies (3Tc' and 9 Tc ) for 
further characterisation. Analysis of plasmid DNA by gel elec- 
trophoresis after Hindlll restriction showed that 5 of the 9 Tc 
colonies contained plasmids with a DNA insert. The other four 
contained the parent plasmid. 

The orientation of the inserted DNA was determined by 
restriction enzyme analysis of the plasmids. We denote the two 
orientations as either R or L and define R and L orientation 
in terms of the direction of transcription required for HA 
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Fig. 1 A, Predicted nucleotide sequences around the Hindlll sites of 
pWTlll, pWT121 and the FPV-HA gene. Atiows indicate the position of 
the Hindlll sites and, tor clarity, only the sequence of the coding strand is 
shown. The protein sequence of the N-terminus of anthranilate synthetase 
has previously been reported by Lee et al^". The numbering system is 
bidirectional beginning with the 5'-nucleotide of the gene 4 complementary 
RNA. B, Potential base pairing between the 3'oligonucleotide of E. coli 16S 
RNA" and the pre-AUG region of the haemagglutinin mRNA. 
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FiE 2 A, Orientation of the FPV gene in pWTlll, 
p^l21 and PBR322. PBR322 pWTl 1 1 or 
□WT121 (10 HE) was limit digested with Hindlli. 1 tie 
resulting S'-terminal phosphates of the vectoi^ were 
removed by treatment with 20 ,ig bacterial alkahne 
phosphatase for 30 min at 37 =C in a 25 M-Hncubation 
containing 20 mM Tris-HCl pH 7.5 and 0.1% SDS^ 
After phenol extraction and ethanol precipitation 0.2 
Hg of each DNA was ligated to ~ 40 ng FPV;H A gene 
in a 20 p.1 incubation containing 50 mM lns-hn.,1 pH 
7 3 10 mM MgClj, 20 mM dithiothreitol, 1 mM ATP 
and 0 02 units T4 DNA ligase (New England Bioiabs). 
After overnight incubation at 15 'C the mixtures were 
diluted to 100 v\ with TCM (10 mM Tris-HCl pH 7.5, 
lOmM CaClj, 10 mM MgCy and used to transfect 
200 M.1 CaClj-treated E. coii K12 HBlOl by a pre- 
viously described procedure^'. Transformants present 
after growth for 16 h at 37 «C on L-agar , plus 
lOO^Lgml-' carbeniciUin (Pyopen) were tested for 
tetracycline resistance by picking onto agar plates 
containing M9 salts, glucose and casamino ac^s plus 
carbeniciUin and tetracycline (10 ngml )- ™ P'^*" 
mid isolation colonies were picked and lOO-inl 
cultures grown in M9 salts, glucose, casamino acids 
medium supplemented with carbeniciUin. At an Atm 
of -0 6 choramphenicol was added to a final 

concentration of 150 tig ml"' and incubation _ , ^^■i^™,^, 

iced 25% sucrose in 50mM Tris-HQ pH 8- ^ells were lysea oy intervals. The resulting lyi 
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^M Tris-HCl pH 8. Cells were lysed by the addition of 0.5 ml 'f "^y'^^^^^^™' d^^^^ by sp nning at IS.OOOg for 20 min 

solution (0 1% Triton X 100 50 mM Tris-HCl pH 8, 50 mM EDTA) 5;min mterv^- 
solution (0.1/. dthenchloroformandfinallypredp«atedbytheaddi iono 0.1vo^^ 



pelleted, washed with 70% ethanol, dried and dissolved !n TE. ^'^^^^^^^'^^"o^ KgesB werl fractionated on a 1 .4% agarose slab gel^ and bands visual sed 
Llabs)withthemodificationth3tRNase(50^grnlM^^^^^ 

under UV light after staining with ethidium bromide. Lane a conta^^^^^^^ 502(R); e, pWTtU/FPV 503(L), /, 

lanes contain PsH digests of the following plasm.ds: *, P™1/FPV 41KW^ i / ^ ^.^^jjj ^.^^ j„„„„ream from thejrp P™'"°'7,^ 

nRR-;22/FPV 604(U- ft PBR322/FPV 605(R). B, Structure of pWT121/l-FV 41 i(Kj. pw i li ,. between the PsrI and HindlU sites are shown as well as 
SaSo™^^^^^ 

the direction of transcription from the ,rp promoter and the P™"°" '^'^^i"'' region as well as the nucleotides specifying the leader sequence 
pub^ishedelse«he^e^Briefiy,the497-basepa.rH,.fIfragmentconan.nghe 

and first seven amino acids to trpE (ref . 10) was '^f'^^ and don i^^^^^^ P Elimination of the Hi«dIII site "P"'^""" 

^H^nrSotiTpr^^^^^^^ 

andpWT131. 



production. That is, a gene in the R orientation required 
rightward (clockwise) transcription for expression; in the pW 1 
plasmid series this is the orientation for expression from the 
trp promoter. For clarity we have included the letter IR) or (L) 
after the plasmid number to indicate its orientation pWi 121 
contains a single Pstl site, in the ampiciUin gene , 1,251 base 
pairs from the H.ndlll site and the FPV HA gene contains 2. Pstl 
site 1 178 base pairs from the 5' end of the coding strand . Thus 
Pstl digests of the recombinant plasmids should indicate which 
are correctly orientated with respect to the trp promoter. 
Representatives of the two FPV-gene containing groups are 
shown in Fig. 2A (b and c). TheTc^ plasmids gave rise to a band 
of 1 800 base pairs on Pstl digestion while the Tc plasmids 
produced one of 2,450 base pairs. These are consistent with the 
fragments expected from the plasmid shown in Fig. 2B and 
indicate that all the Tc' plasmids contained inserts in the L 
orientation while the Tc"* plasmids contained mserts in the R 
orientation. 

In a similar way we re-cloned the HA gene into the HtndlLl 
site of pWTlll and pBR322. Again the pWTlU/FPV plas- 
mids could be divided into Tc"' and Tc^ groups and PsA restric- 
tions (Fig. 2A, d and e) of representatives of these groups 
indicated that the HA gene was in the R orientation m the Tc 
group and in the L orientation m the Tc group. Ihe 
pBR322/FPV plasmids however were all Tc^ but still contained 
L and R orientated genes as shown in Fig. 2A (/ and g). 

The correct phasing of the HA gene in pWT121/FPV 411(R) 
with the trpE AUG was confirmed by nucleotide sequencmg. 
The HA gene contains an EcoRII restriction site at positions 
36-40 (ref 6). This site, however, is methylated and although 
not cleaved by EcoRH, it is cleaved by the £coRII isoschizomer 
BsfUl. Therefore we digested pWT121/FPV 411(R) with 
Bstm, labelled the 5' ends with ^^P, recut the DNA with Hpal 
and isolated the fragment containing the initiation region. The 
nucleotide sequence of this fragment was complementary to the 
sequence predicted in Fig. lA, except for nucleotide 'vhich 



was an A instead of a G. This was also the case for the 
corresponding position in pWTlll. In both plasmids this posi- 
tion corresponds to the 3'-terminal nucleotide of the H^ndlll 
linker The reason for the change is not clear; it might be due to 
random mutation or, more Ukely, to micro-heterogeneity or 
incomplete deblocking after chemical synthesis at the 3 ends ot 
the ffindlll linker. In either event the effect m the mRN A Strand 
is to have a serine codon (UCG) instead of a proline codon 
(CCG) but does not change the phase with respect to the trpt 
AUG. 

Gene expression 

E coli colonies containing representatives of the plasmids 
described above were screened for FPV-HA antigen usmg a 
solid-phase immunological method'^ Briefly, small cultures of 
individual colonies were grown, collected and lysed usmg 
lysozyme and Triton X-100. Any HA sequences present were 




Fia 3 Control of HA production. Overnight cultures of £. coli K12 HB 
101 containing pWT121/FPV 411(R) were grown in M9 salts, glucose, 
casamino acids medium supplemented with carbeniciUin (lOOngml ). 
Aliquots of 100 M.1 were inoculated into 5 ml of the above medium contain- 
ing either tryptophan or lAA as indicated. Cultures were grown for 2 h at 
37 °C, the A(,o„ determined and cells collected, lysed and assayed for HA 
content as described in Table 1. 
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table 1 Radioimmunoassay of FPV-HA in lysates of bacteria containing 
influenza genes 



Orientation 



Plasmid 

pWT121/FPV411 
p\VT121/FPV4i2 
pWTHl/FPV 502 
pWTUl/FPV503 
pBR322/FPV 604 
pBR322/FPV 605 
pWT121 
pBR322 





Phenotype 


HA content 




(ngp«50p.l) 


R 


Tc« 


20.3 


L 


Tc* 


5.0 


R 


Tc'^ 


4.1 


L 


Tc^ 


2.1 


L 


Tc^ 


3.6 


R 


Tc= 


0 




Tc" 


0 






0 



Liquid cultures (5 ml) of indivLdua! colonies were grown in M9 medium. Cells were collected 
by certrifugation, and lysed. in a final volume of 1 ml, as descrilied in Eg. 2A. This lysate was 
used for antigen assay without farther ireatment. HA-ansigen assays were performed m 
polystyrene culture tubes (Nunc no. 1410) using an antibody sandwich technique . Rabbit 
anii-FPV HA was purified by amraorium sulphate precipitation and DEAE-cellulose chroma- 
louaphy'" before use and labelled with '^^! as described elsewhere . Tubes were coated with 
50 M.I of 0.2M NaHCOjpH 9 containing 60 ti| mf ' purified, unlabelled IgG for 10 mm at room 
temperature, washed with 3 x 1-ml aliquotsof wa5hbuffer(phosphate-baffered sabnc containing 
0 1% bovine semm albumin, 0.1% NP-40 and 0.5% normal goal serum) and then incubated 
at room temperature either wish known amounts (1-20 ng) of Sarkosyi-disrupied FPV or with 
the bacterial lysates. After 2 h, the tubes were washed with 4 x l-ml aliquots of wash buffer and 
finally incubated at 4°C for 16 h with SO p.! wash buffer containing200,000 cp.m. I-anit-HA. 
Tubes were again washed and counted in a Packard y-counter. Antigen present was quantitaied 
from standard curves relating the radioactivity hound to the amount of FPV present. 



bound to a polystyrene tube coated with FPV-HA specific IgG. 
Bound antigen was then detected by incubating with high 
specific activity ^"l-anti FPV-HA. As indicated in Table 1, 
immune reactivity was detected from all colonies containing an 
FPV-HA gene inserted into the pWT plasmids and from 
pBR322 containing the HA gene in the L orientation. Neither of 
the parent plasmids nor pBR322 containing the HA gene in the 
R orientation produced any immune reacting material. Further, 
the positive reactions were aboUshed if the tubes were coated 
either with normal rabbit IgG or with anti A/ Victoria— HA IgG 
instead of specific anti-FPV-HA IgG. 

It was interesting to find expression of HA antigen from one 
orientation of pBR322/FPV. The Mndlll site of pBR322 lies in 
the promoter region of the tetracycline gene and it is known that 
cloning at the Hin&lW site destroys this rightward transcribing 
promoter^" unless the cloned DNA has its own promoter tran- 
scribing into the tetracycline gene(s)^\ We therefore propose 
that a previously unknown promoter which transcribes in the 
leftward direction exists on the tetracycline gene side of the 
Hwdlll site of pBR3 22 . It is necessary to postulate the existence 
of this promoter to explain the expression of the HA gene in 
pBR322/FPV 604(L), pWT121/FPV 412(L) and 
pWTUl/FPV 503(L). The possibility that there is a pseudo- 
promoter sequence in the HA gene itself, for example the 
{Ti9):(Ai9) sequence, is ruled out as the plasmid pBR322/FPV 
605(R) does not express its HA gene (Table 1). 

This postulated promoter would explain the transcription of 
the HA gene in the L-orientated plasmids. How is translation 
explained? Inspection of the nucleotide sequence around the 
tetracycline-HA gene junction of pBR322/FPV 604{L) reveals 
nonsense triplets in all three translation phases". Thus the 
bacterial translational system is probably recognising a nucleo- 
tide sequence on the HA mRNA itself and initiating protein 
synthesis at the AUG of the pre-peptide. Comparison of the 
sequence of the untranslated region of the HA gene with that of 
the 3' end of prokaryotic 16S ribosomal RNA'^ reveals surpris- 
ing complementarity (Fig. 1J5). Consistent with this inter- 
pretation is the expression of HA antigen in pWTlll/FPV 
502(R), In this case the HA gene is in the wrong phase to be 
translated from the trpE AUG; protein synthesis initiating at the 
trpE AUG is terminated by the now in phase UGA triplet at 
position 23-25 in the HA gene. Thus we conclude that initiation 
is from the natural HA AUG at position 22-24 (Fig. lA). 
Assuming equal transcription from the trp promoter in 
pWT121/FPV 411(R) and pWTlll/FPV 502(R) and that the 
products have equal antigenic properties, then from the data of- 
Table 1 we conclude that the eukaryotic ribosome binding site.is 



recognised with an efficiency of 20% compared to the prokary- 
otic site. 

The fivefold difference in HA expression between 
pWT121/FPV 411(R) and pWTlll/FJ'V 502(R) could be due 
to differences in the plasmid complement of the respective cells. 
To exclude this possibility the relative copy numbers of the 
above plasmids were determined from agarose gels of lysed 
whole cells' and found to be the same (data not shown). 

Hitherto the Mndlll and EcoKl sites of pBR322 have been 
assumed to be non-expression sites because of their respective 
situations in and upstream of the tetracycline promoter^'''^\ This 
has been thought to be of value in assessing the containment 
categorisation when considering the safety of experiments in 
genetic engineering^^. In view of the above findings it is clear 
that the validity of these assumptions should be reassessed. It 
seems likely that a potential ribosome binding site on the 
inserted gene will be of some importance. 

Table 2 Control of HA eipresion from the tryptophan promoter/operator region 







HA* 


Ptasmid 


Addition 


(ng per SO (li) 


pWT121/FPV411(R) 


Tryptophan 
lAA 


0.9 
S9.1 


pWT121/FPV 412(L) 


Tryptophan 


9.6 


lAA 


4.9 



Cultures (5 ml) of pWT121/FPV 411(R> and pWT121/FPV 412(L) were grown for 3 h al 
37 "C in M9 salts, glucose, casamino acids medium supplemented with carbentcillm 
(100 iig ml"') and either lAA (20 ml"') or iryptophan (100 m ml"')- Cells were collected, 
lysed and assayed for HA content as described in Table 1. 

* Results normalised 10 same Asm- 

Control of gene expression 

In vivo the tryptophan operon is controlled by the level of 
tryptophan^* and control is best obtained in culture by using 
tryptophan as repressor or j3 -indole acrylic acid (I A A) as 
inducer. The latter is a competitive inhibitor that prevents 
tryptophan binding and thereby effectively inactivates the 
repressor^*. 

We selected a representative colony from each of the two 
expressing groups (in pWT121) to examine HA synthesis in the 
above conditions. Our initial observation was that 
pWT121/FPV 411(R) grew very slowly in the presence of 
20 /ig ml"' lAA; there was a threefold difference compared to 
growth in the presence of tryptophan. pWT121/FPV 412{L) 
grew equally well in both sets of conditions. The results of 
radioimmunoassay of these cultures are shown in Table 2. 
Because of the differences in growth rate, the values are all 
normaUsed to the Asoo of the culture containing pWT121/FPV 
411(R) plus tryptophan. In pWT121/FPV 411(R), HA produc- 
tion is stimulated 65-fold when the trp operon is derepressed. In 
contrast, lAA decreased HA synthesis in pWT121/FPy 
412(L)..Thus increased transcription from the irp promoter is 
competing with transcription from the presumed promoter near 
or around the tetracycline gene and reducing expression in the 
leftward direction. 

The effects of repression and induction on HA synthesis are 
also seen in Fig. 3. As little as Sfjigml ' tryptophan was 
sufficient for maximal repression. On the other hand, because of 
its effect on growth, the induction characteristics of lAA were 
more complex. Maximal induction occurred at lAA concen- 
trations of 4-8 tig ml"' and HA synthesis was stimulated 120- 
fold over the repressed leveL 

Finally, we examined the immunoprecipitable products in 
cells containing various plasmids after induction or repression of 
trp transcription. At 3 h after addition of either lAA or tryp- 
tophan to cultures of such cells, aliquots were pulse-labelled 
with ^^S-methionine and the immunoprecipitable proteins 
separated by SDS-polyacryiamide gel electrophoresis. No 
immunoprecipitable proteins were present in induced cells 
containing pWT121 or in repressed cells containing 
pWT121/FPV 411(R) (Fig. 4a and b, respectively). However, 
induced cells containing pWT121/FPV 411(R) or 
pWTlll/FPV 502(R) and repressed cells containing 
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rig. 4 Immunoprecipitation o£ the haemagglutinin-like protein Cultures of K12 
Hll01con,ainbgtheplasmidspWT12i,pOTl21/FPV41UR),pWT121/FPV412U^^^^^ 

pWTlll/FPV 502{R) were grown in M9 salts, glucose, casammo acids medium rontaining 
carbeniciUin (100 ti.g mr') and either lAA (10 (ig m!" ') or tryptophan (40 ml ). After a 
rhperiod.5Ulsampleswerererriovedandpulse-labdl^^ 

"S-methionine. The labelling was terminated by the addition of 20 ml cold M9 medium, the 
cells pelleted by centriiugation at 12,000g for 10 min at 4 °C and iysed in a final vokime of 

1 ml, as described in Fig. 2 A. Immunoprecipitations were performed as follows : to 40O fil 
coli extracr was added 2 j.1 10% Nonidet P-40 (NP-10), 4 m-I (2M.g) normal rabbit IgG and 
the mixture incubated for 60 min at 20 "C. An aliquot of 10 m-I of a 10% suspension of killed 
Smpkylococc^ aureus'' (Cowan 1 strain) was then added and incubation continued for a 
further 15 min. The resulting immunoprecipitate was pelleted and discarded. To the 
supernatant was added 2 rabbit anti FPV-HA and, after 60 mm at 20 C IO m-I of S. 
aureus. This immunoprecipitate was pelleted 15 min later and washed twice w«h a solution 
containing 0.15MNaCl, 5mM EDTA, 50 mM Tris-HO pH 7.4 f^d 0.05% NP-40. 
Radioactive proteins were eluted from the complex with 30 h.1 SDS buffer, heated 90 C tor 

2 min and electrophoresed on a 12.5% polyacrylamide-SDS gel" which was then dried and 
autoradiographed. The figure shows the immunopreciptoble proteins from ce^on aining 
the following plasmids: a, pWT121 + lAA; b. pWT121/FPV 41 UR) + irp ; c, pWTl 21/FPV 
4n(R) + IAA-d,pWTlll/FPV502(R) + IAA;e,pWT121/FPV412(L) + (rp;/contamsan 

extract from Induced cells containing pWT121/FPV 411{R). The positions of conalbumin 
(MW 76,000), bovine serum albumin (69,000), ovalbumin (43,000) and carbonic anhydrase 
(30,000) are indicated by the arrows. 



pWT121/FPV 412(L) all produced a band of MW 61,000 (Fig. 
4c, d and e respectively). Our experimental design predicted 
that the HA-like protein synthesised from pWT121/FPV 
4U(R) would have a MW of 69,168 and that the products from 
pWT121/FPV 502(R) and pWT121/FPV 412(R) would be 26 
amino acids smaller at the N-terminus. As the three products are 
in fact the same size, it seems probable that the primary gene 
product has been processed and a portion of the polypeptide 
removed. The same results were obtained when the products of 
these plasmids were examined in minicells (unpublished results). 
Clearly the mechanism of the processing and the possible 
involvement of the pre-HA sequence need further investigation. 



Conclusions 

We have demonstrated the feasibility of producing controlled 
amounts of influenza antigenic determinants by genetic 
engineering. Obviously further analysis is necessary to charac- 
terise the protein product and a number of questions remain 
unanswered. First, for example, initiation of protein synthesis 
from the prokaryotic-like ribosome binding site on the HA will 
result in a polypeptide having a eukaryotic leader sequence. This 
sequence is thought to function in the transmembrane move- 
ment of the HA in vivo'^; we may question whether this 
sequence is recognised by the regulatory systems for export in 
bacteria. Second, what effect do the (T19) ; (A 19) region, resulting 
from the oligo(dT) primer and the poly(A)tail of the vRNA and 
situated between the AUG of trpE and the HA sequence, or the 
(Phe)6 oligopeptide resulting from its translation, have on tran- 
scription and translation respectively? Homopolymeric nucleo- 
tide regions such as this are one of the signals implicated in 
transcription termination^'; thus the (Ti9):(Ai9) region may 
have a polar effect on HA production. Similariy, the (Phe)6 
region may result in depletion of charged tRNAp^, and slow 
ribosome movement along the mRNA. Clearly examination of 
HA production by plasmids where this region has been removed 
is essential. 




The absolute quantity of HA-like protein produced may also 
be estimated. The estimates based on immunological detection 
cannot be precise and are probably minimum values since if 
there are differences from the natural antigen it seems likely that 
the bacterial product would have fewer antigenic determinants 
or lower affinity for the antiserum. Based on our maximum yield 
of 60 ng per 50 m.1 bacterial lysate (Table 2) and a protein 
concentration of 160 \Lg mr' for the same lysate we arrive at a 
figure of 0.75% of total protein. By comparison, the quantities 
of induced proteins can be determined by densitometry from 
autoradiographs of gels containing proteins from induced and 
repressed cells. This shows that the 61,000 MW protein is 2-3% 
of total protein synthesis (unpublished results). Neither of these 
values was as high as expected. Hallewell and Emtage^^- have 
recently cloned an E. coli Hindlll fragment containing the trp 
promoter and the complete trpE gene in pBR322; in induced 
conditions this plasmid was capable of specifying up to 30% of 
total protein as anthranilate synthetase^l Our values however 
are similar to those reported for the synthesis of ovalbumin-like 
proteins under the control of the lac promoter^*'^'; that is, the 
level of expression of the eukaryotic genes is only about 10% of 
the expected. Whether this is due to limiting amounts of some 
tRNA species as proposed by Eraser and Bruce'" or to the type 
of transcription termination discussed above remains to be seen. 

We thank Drs J. Oxford and M. Gethin for gifts of rabbit 
anti-FPV HA and Staphylococcus aureus respectively and Dr A. 
Hale for providing research facilities. Cloning and expression 
experiments were performed in a CIII laboratory as recom- 
mended by GMAG. 
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Communicated by Paul Doty, December 6, 1978 

ABSTRACT We present a method, utilizing a combination 
of restriction endonuclease cleavage and digestion with Esch- 
erichia coli exonuclease III and Aspergillus orizae nuclease SI, 
that allows us to position a restriction fragment bearing the 
promoter of the lacZ gene of E. coli at virtually any distance in 
front of any cloned gene. In particular, we have used this 
method to examine the effect on protein production of gene- 
promoter separation for the cro gene of phage X and to produce 
plasmids that, upon transformation into appropriate E. coli 
hosts, direct the synthesis of up to 190,000 cro protein monomers 
per cell. 

Previous reports from this laboratory have described the use 
of a DNA fragment generated by restriction endonuclease 
cleavage as a "portable promoter." This fragment bears the 
promoter of the lac operon of Escherichia coli and is capable 
of directing efficient transcription in vivo of genes to which it 
has been fused in t>i<ro. The fusions are carried on plasmids 
derived from pMB9 (1, 2). In the case of the cl gene of phage 
\, the amount of cl protein (X repressor) produced by a strain 
bearing a given cl-lac promoter fusion was a sensitive function 
of the gene-promoter separation. Strains bearing one of these 
plasmids, pKB280, direct the synthesis of over 30,000 monomers 
of cl protein per transformed cell, or roughly 150-foid more 
repressor than is found in a typical X lysogen. In pKB280 the 
DNA fragment containing the lac promoter was abutted to the 
end of the cl gene so that the ribosome binding site (see Dis- 
cussion) of the fusion was a hybrid of lac and X sequences. 
Fusions which placed the lac promoter at a considerably greater 
distance from the cl gene produced l/5th to 1/lOth the amount 
of cl protein. In these experiments, however, the actual number 
of fusions explored was limited by the availability of convenient 
restriction cuts in close proximity to the 5' terminus of cl. 

In this communication, we describe a method that, in prin- 
ciple, will allow the same lac promoter fragment to be placed 
at virtually any distance in front of a gene. The promoter 
fragment does not encode a translational start. It does, however, 
encode a sequence required for binding the message to the 
ribosome (see Discussion). Therefore, our gene-promoter fu- 
sions will produce a native protein rather than a fusion protein 
carrying foreign amino-terminal amino acids (3, 4). In partic- 
ular, we present details of the construction of a series of lac 
promoter fusions to the cro gene of X designed to examine 
systematically the effect on protein production of gene-pro- 
moter separation, and to produce a strain that synthesizes large 
amounts of cro protein. The best of these strains directs the 
synthesis of 190,000 monomers of cro protein per cell. 



The publication costs of this article were defrayed in part by page 
charge payment. This article must therefore be hereby marked "ad- 
vertisement" in accordance with 18 U. S. C. §1734 solely to indicate 
this fact. 



EXPERIMENTAL PLAN 

Our basic approach is to clone a gene in a plasmid such that a 
unique restriction endonuclease cleavage site is located near 
(within approximately 100 base pairs of) the 5' end of the gene. 
We then open the plasmid at that site and excise varying 
amounts of DNA with exonuclease III and the single-strand- 
specific endonuclease SI. We then insert a small DNA fragment 
bearing the promoter of the lac operon of E. coli and close the 
plasmid. This produces a set of plasmids bearing the promoter 
separated by varying distances from the gene. The specific 
procedure we used to construct the various lac promoter-cro 
gene fusions is diagrammed in Fig. 1. As indicated in the next 
paragraph, many of these steps might easily be modified to 
accommodate differing configurations of restriction enzyme 
cuts surrounding different genes. The essential features of the 
experiment of Figure 1 are as follows: (i) the cro gene was 
cloned: a DNA restriction fragment 550 base pairs long and 
bearing the cro gene was sheared to roughly 260 base pairs. 
Chemically synthesized Bam linkers (5) were ligated to the ends 
of this shorter fragment, and the resulting fragment was cloned 
into the Bam site of pBR322 (6) yielding pTR116. (ii) The Bam 
cut at the carboxy end of the cro gene in pTR116 was removed: 
the plasmid was partially digested with Bam; the resulting Bam 
sticky ends were rendered flush with E. coli DNA polymerase 
I and the four deoxynucleotide triphosphates (1). This left a 
single Bam cut in the resulting plasmid (pTR151) 54 bases from 
the amino terminus of the cro gene, (iii) Some or all of the 54 
base pairs of DNA between the Bam cut and the ATG signaling 
the start-point of translation were removed: pTR151 was cut 
with Bam and resected for various times with E. coli exonu- 
clease III with subsequent treatment with Aspergillus orizae 
nuclease SI to remove single-stranded tails, (ic) The plasmid 
was cut at the unique R I site some 375 base pairs upstream 
from the Bam site, and an R l-Alu restriction fragment con- 
taining the lac promoter was inserted into the plasmid back- 
bone. After transformation into E. coli, the resulting plasmids 
were characterized as described below. 

In theory, the cro gene in the above construction could have 
been any gene, and the Bam site could have been any unique 
restriction site. If the Bam site had been located inconveniently 
far from the beginning of the gene, it could have been moved 
closer by opening the plasmid with Bam, digesting with Exo 
III and SI, and then religating the resulting plasmid in the 
presence of an excess of Bam linkers. The R I site utilized in the 
cro construction can also be substituted for by several other 
restriction sites on pBR322 (e.g., Pst, Bam, Hin III, or Sal I). 
We wish to emphasize the convenience of the construction used: 
the most difficult step, the original cloning of the gene, is done 
once and then left unchanged. The only inserted fragment, the 
lac fragment, bears a lac operator. Cells bearing plasmids 
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Fig. 1. Schematic representation of the methods of plasmid construction. The approximate locations of several restriction endonuclease 
cleavase sites are shown for the plasmid pBR.'i22, for a DNA fragment hearing the era gene of phage A, and for a DNA fragment bearing the 
promoter of the lacZ gene [see Backman and Ptashne (2) for the source of this fragment). The location of the tet and lacZ gene promoters are 
indicated, as are the extent of the tet and cro genes. SD,. and SDi indicate the Shine-Dalgarno sequences of the cro and lacZ genes, respectively. 
AUG and UAA are the start and stop signals for translation of the cro protein. Distances are indicated in base pairs. Steps: (i) The fragment 
bearing the cro gene was shortened by shearing to remove certain A control elements near the 3' end of the gene, and the smaller fragment was 
inserted into the Ram site in pBR322 by using Ram linkers. ((; ) The Ram site near the carboxy terminus of the cro gene was eliminated. [Hi) 
The plasmid was opened at the Ram site and varying amounts of DNA were removed by Exo III and SI. (iv) The partially resected plasmid 
was cut at the R I site, and the lac promoter (bearing the UV5 mutation rendering it independent of catabolite activator) was inserted by "sticky 
end" ligation at its R I end and by "blunt end" ligation to the resected plasmid DNA at its Alu end. The efficiency of steps Hi and iu are fairly 
high — about 200-400 plasmids result from 1 Mg of pTR151 input. 
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carrying the operator are easily recognized on indicator plates. 
Plasmids bearing the lac operator cause colonies of cells to turn 
blue on the appropriate indicator plate, because multiple copies 
of the operator titrate the lac repressor resulting in synthesis of 
i8-galactosidase (see ref. 1). 

MATERIALS AND METHODS 

Strains. E. colt strain 294 (Endo I, ri(~mi("'"Bi~pro~) was used 
as the host for all plasmids. Cells were grown in TB medium 
supplemented with 0.1% yeast extract (7). Phage strains XKH54 
(8), X4V'S (Va V305 Vse Vgase) (9), and Xc411 [selected from 
4V'S by growth on strains producing large amounts of cro (J. 
Eliason, unpublished results)] were used for cross-streaking. 

Enzymes. T4 ligase and E. coli DNA polymerase I were used 
as described in Backman et al. (1). Bgl II restrictions were 
carried out in 6.6 mM Tris-HCl, pH 7.4/6.6 mM MgCl2/6.6 
mM 2-mercaptoethanol. Bam, R I, and Hae III digestions were 
carried out in the same buffer plus 60 mM NaCl. Exonuclease 
III (10) digestions were carried out at 22.5°C in the same buffer 
used for Bom digestion in 25-50-m1 reaction mixtures at a DNA 
concentration of 100 iig/rcA by using eight units of exonuclease 
III per ^Lg of DNA. Under these conditions Exo III digests at a 
rate of 8-10 base pairs per min per end depending on the batch 
of enzyme. Exonuclease III digestion was halted by addition 
of an equal volume of two times concentrated SI buffer: 100 
mM NaOAc-HOAC, pH 4.0/300 mM NaCl/12 mM ZnS04. 
SI nuclease was added to a concentration of 75 units per 50 /il 
and allowed to digest for 2 hr at 18°C (11). The reaction was 
stopped by phenol extraction, and the DNA was purified over 
a G-50 fine Sephadex column before further reaction. 

Plasmid Constructions. pTRllG. The 550-base-pair Hin II 
restriction fragment (5 ^g) bearing the cro gene was dialyzed 
against 100 mM NaOAc, pH 8.0/1 mM EDTA/66.7% glycerol 
in a final volume of 20 ml. The DNA was sheared for 30 min 
at 40,000 rpm in a VirTis homogenizer. A dry ice/isopropanol 
bath was used for cooling. After ethanol precipitation of the 
DNA, polymerase 1 was used to fill in the single-stranded re- 
gions generated by shearing (providing the single-stranded 
projection had a 5' end). Bam linkers (5) were then ligated onto 
ends of the shear fragments (75 pmol of linkers in a total volume 
of 20 ti\). After digestion with Bam, the fragments (now bearing 
Bam sticky ends) were cloned into the Bam site of pBR322. 
Transformed cells were selected for ampicillin resistance and 
X immunity (1). DNA was purified (12) from a small number 
of candidates, and the sequence of the inserted Bam piece from 
the plasmid with the smallest insert (pTR116) was determined 
(13). The insert extended from the Hin II site at the left end of 
the original restriction fragment to a point five bases past the 
3' end of the cro gene (data not shown). The shearing procedure 
thus neatly separated the cro gene from certain X control ele- 
ments near the carboxy terminus of the gene that we wished 
to exclude from our future constructions. 

pTR151. pTRlie DNA (5 fig) was partially digested with 
Bam, and those DNA molecules receiving only one Bam cut 
were separated from the other species present by agarose gel 
electrophoresis. After extraction from the gel (14), this DNA 
was treated with DNA polymerase I and the four deoxynucle- 
otide triphosphates to render flush the Bam sticky ends (l). The 
resulting molecules were circularized with T4 ligase and used 
to transform E. coli. DNA was isolated from several ampicil- 
lin-resistant, X-immune colonies, and restriction analysis was 
carried out to determine which of the product plasmids had lost 
the desired Bam site (data not shown). 

pTRiei . pTRlSl DNA (5 fig) was opened with Bam. DNA 
pwlymerase 1 was used to fill in the resulting sticky ends. The 
DNA was then cut with R I and the R I-A/u promoter fragment 



(in 3-fold molar excess) was ligated into place. All plasmids were 
constructed under conditions conforming to the standards 
outlined in the National Institutes of Health guidehnes. 

Radioimmunoassay, cro protein was measured in lysates of 
transformed cells by radioimmunoassay using the procedure 
of A. Johnson (unpublished results). Radioimmunoassay mea- 
surements were quite precise — there was, on the average, less 
than 5% difference between duplicate measurements on a 
culture. The amount of cro protein was found to be a function 
of cell growth conditions. However, measurements made on 
separate occasions on cultures of independent transformants 
bearing a given plasmid differed by less than 35%, and the ratio 
of cro protein present in strains bearing two different plasmids 
varied less than 10% from day to day. Gel electrophoresis of 
total soluble protein from strains bearing plasmid pTR214 gave 
estimates of cro protein as a percent soluble protein in accord 
with the radioimmunoassay determination (data not shown) 
— this indicates that the majority of the cro protein detected 
by radioimmunoassay is intact. 

DNA Sequence Analysis. DNA from each of the various 
X-lac fusion plasmids was digested with Bgl II, which cleaves 
at a unique site 67 bases downstream from the amino terminus 
of the cro gene. The resulting linear DNA was 3'-end labeled 
by using DNA polymerase I and [a-^^P) dCTP and [a-^^P] 
dATP (15). Subsequent R I digestion released a small DNA 
fragment, 162-220 base pairs long, bearing the region of fusion 
of lac and X sequences. This fragment was isolated by polyac- 
rylamide gel electrophoresis, and the sequence through the 
fusion region was determined by the method of Maxam and 
Gilbert (13). 

RESULTS 

pTRlSl DNA was opened with Bam, resected with exonuclease 
III for times ranging from 30 sec to five min, and then treated 
with SI nuclease (16). An R l-Alu fragment bearing the lac 
promoter was ligated into place. After transformation, cell 
cultures were plated on ampicillin selective plates containing 
the indicator 5-chloro-4-bromo-3-indolyl-|3-D-galactoside. Blue 
colonies, arising from cells that bear the lac promoter on the 
transforming plasmid and are thus lac constitutive, were then 
checked for cro protein production by cross-streaking against 
phage X. A typical experiment yielded 400 colonies per ng of 
pTR151 DNA, 70% of which are blue. Of these blue colonies, 
10-90% were immune. Our assumption, later confirmed by 
radioimmunoassay, was that increasingly high levels of cro 
protein production would confer resistance to ever more viru- 
lent derivatives of X. Clones were tested for immunity to phages 
bearing no operator mutations (XKH54) or four or more oper- 
ator mutations (X4V'S, Xc411). In this way, the transformants 
were divided into categories by level of immunity. DNA was 
isolated from some 40 transformants, representative of the 
various classes, and characterized by digestion with Hae III (see 
Fig. 2). Because the entire sequence of pBR322 is known (17) 
as well as the sequences of the lac promoter fragment (18) and 
the cro gene itself (19), the Hae digests were readily interpreted. 
The size of the deletion made by exonuclease III and SI nu- 
clease treatment was estimated from the size of the fragment 
containing the lac-cro fusion. 

Finally, nine representative transformants were chosen for 
more detailed analysis. The level of cro protein produced by 
each of these clones was measured by radioimmunoassay, and 
the DNA sequence across the lac-cro fusion was determined. 
Fig. 3 shows the results of this analysis. 
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Fig. 2. Analysis of Hae III digests of plasmid DNA by gel elec- 
trophoresis. Plasmid DNAs from the various cm protein-producing 
strains were partially purified and digested with Hae III. Digests were 
electrophoretically separated on 8% polyacrylamide gels. Of the Hae 
III fragments produced from any plasmid, all but one fragment are 
common to all plasmids. The one varying fragment contains the region 
of fusion between lac and X sequences. It can be sized by comparison 
to the other fragments, the whole sequences of which are all known. 
The lengths in base pairs of four of these pBR322 fragments are in- 
dicated. 



DISCUSSION 

Strains bearing two of our fusions, pTR213 and pTR214, are 
rather spectacular overproducers of cro protein. The levels of 
cro protein as a percent of total soluble protein shown in Fig. 
3 of 1.6% and 1.0% correspond to roughly 190,000 and 120,000 
monomers of cro per cell. We know of no other E. coli strains 
which produce any protein to a significantly greater extent on 
a molar basis. If there are 50 copies of pTR213 per cell [the copy 
number of pBR322 (6)], each lac promoter must be directing 
the production of 4000 monomers of cro. This is roughly the 
fully induced level of expression of j3-galactosidase from the 
same promoter when it directs transcription of the lac operon 
(2). We have not excluded the possibility that these fusions have 



acquired additional mutations that change the number of copies 
of pliasmid maintained in the cell. The high frequency with 
which fusions are produced that direct the synthesis of large 
amounts of protein argues strongly against this possibility, 
however. 

The lac-cro fusions presented here represent the beginning 
of a systematic examination of the effect of gene-promoter 
separation on protein production. The nine plasmids described 
in Fig. 3 all carry a complete cro gene and a complete lac pro- 
moter, separated by varying distances. However, the variation 
in the amount of cro protein produced by strains bearing the 
different fusion plasmids is enormous. For example, strains 
carrying pTR213 produce over 2000-fold more cro protein than 
strains carrying pTRlQO. Current theory offers plausible ex- 
planations for some of the observed differences, but fails to 
explain all of them. The same promoter is being used in each 
plasmid, and so we assume that transcription across the cro gene 
is uniform in each case, and therefore the differences in protein 
productions are due to differences in some post-transcriptional 
process. The cro mRNAs transcribed from the various plasmids 
differ in their leader regions, and these differences might affect 
mRNA stability or processing or ribosome binding efficiency. 
We do not know how the presence of different leaders might 
influence the stability or processing of a message. However, 
current theory holds that a ribosome binding site on a prokar- 
yotic mRNA is composed of two parts: the AUG or GUG sig- 
naling the start-point of translation and the so-called Shine and 
Dalgarno sequence — 3-9 bases in the leader portion of the 
message that are complementary to bases at the 3' end of the 
168 rRNA (21, 22). In six of the nine plasmids, the bases coding 
for the cro Shine-Dalgarno sequence are intact (pTR161, 213, 
199, 214, 188, and 194), and it seems reasonable to assume that 
this sequence is functioning in ribosome binding. Although 
relatively high levels of cro protein are produced in cells bearing 
all of these plasmids, there are striking differences among them. 
For example, the deletion in pTR199 is only three bases longer 
than that in pTR213 and five bases shorter than that in pTR214; 
yet, strains transformed with pTR199 direct synthesis of 1/lOth 
of the protein of either of the others. In pTR210 one of the bases 
of the cro Shine-Dalgamo sequence has been removed, and yet 
strains carrying this plasmid produce 10-fold higher levels of 
cro protein than strains carrying pTR199. We have no expla- 
nation for these differences. In the two remaining plasmids, the 
bases coding for the cro Shine-Dalgarno sequence are either 



|AATT6T6AGCG6ATAACAATTTCACACEG6AjftACAG6ATCCG66ACTATTTTACCTATG6C66TGATAAT6GTTGCATGTAC[rAA66A66Tfr6TAT6 
TTAACACTCGCCTATTGTTAAAGTGTGtrCniTTGTCCTAGGCCCTGATAAAATGGATACCGCCACTATTACCAACGTACATG lATTCCTCCAl ACATAC 



pTR213 
pTR199 
PTR214 
PTR188 
pTR194 
PTR210 
pTR182 
pTR190 



FlC. 3. A summary of the deletion size and cro pr<xiuction of selected plasmids. Shown is a portion of the sequence of pTR161 extending from 
the start-point of transcription of the lac promoter on the left (straight arrow) to the start-point of translation of the cro gene on the right (wavy 
arrow). All plasmids produced by the£xo III-Sl method described in Fig. 1 may be considered deletionsof pTR161. The extent of these deletions 
is indicated by brackets. The numbers on the brackets are the level of cro protein as a percentage of total soluble protein in cells transformed 
with each of the deletion plasmids. For comparison, cro protein represents 0.5% of the total soluble protein in cells transformed with pTR161, 
and a previously reported plasmid (20) in which cro transcription is initiated at its own promoter directs synthesis of 0.05% cro. By cross-streaking, 
strains bearing pTR182 and pTR190 are immune to XKH54; and strains bearing pTR213, 214, 188, 194, and 210 are immune to AKH54, X4V"S, 
and to Xc411, a X strain selected from X4V"S for growth on E. coli strains producing large amounts of cro. 
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largely (pTR182) or totally (pTRlQO) absent. Cells transformed 
with these plasmids produce extremely small amounts of cro 
protein. This is perhaps understandable in light of the absence 
of the cro Shine-Dalgarno sequences; however, it must be re- 
membered that presimiptive ribosome binding sites in pTR182 
and pTR190 are actually coded for by a fusion of lac and X se- 
quences. The lac promoter fragment used in the construction 
of all the plasmids described here contains, in addition to the 
binding site for RNA polymerase and the start-point of tran- 
scription, the coding region for most of the leader of the lacZ 
message including its Shine-Dalgarno sequence. Thus, in 
pTR182 and pTR190, the lac Shine-Dalgarno sequence is 
brought into close proximity with the ATG of cro. We know 
that such hybrid ribosome binding sites can be extremely ef- 
fective from the results with pKB280 (2). It is possible that the 
separation of this sequence and the ATG in pTR190 and 
pTR182 is not optimum — the lac sequence is separated from 
the ATG of cro by 10 bases in pTR182 and by 5 bases in 
pTRlQO, whereas the same sequence is separated from the ATG 
of the cl gene by 8 bases in pKB280 and is separated from the 
ATG of the lacZ gene by 7 bases in the E. coli chromosome. 
Work is in progress to produce a lac-cro fusion in which the lac 
Shine-Dalgarno sequence is seven bases from the ATG of 
cro. 

The method of exonuclease III and SI nuclease digestion used 
here should allow the placement of the promoter-containing 
fragment at virtually any distance upstream from most other 
genes. As we mentioned earlier, the requirements for restriction 
sites in and around other genes of interest are extremely flexible. 
Perhaps the only limitation in promoter placement suggested 
by the sequence data in Fig. 3 is a tendency of the deletion to 
stop at As or Gs in preference to Ts in the 5' (top) strand (there 
are not enough Cs available to make a judgment). In particular, 
the technique should be useful in positioning the lac promoter 
fragment adjacent to a eukaryotic gene with the hope of 
forming a hybrid ribosome binding site from the lac Shine- 
Dalgarno sequence and the ATG of the eukaryotic gene. Ex- 
pression of the eukaryotic gene could be monitored by ra- 
dioimmune techniques (23, 24). It remains to be seen whether 
such fusions will produce eukaryotic proteins in E. coli. 

We thank A. Johnson for purified cro protein and anti-cro serum, 
J. Eliason for phage stocks, A. Jeffrey for restriction enzymes Hae III 
and R I, W. McClure for E. coli exonuclease III and polymerase I, and 
R. Scheller for Bam linkers. 
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PROTEIN SYNTHESIS 

This is a continuation of application Ser. No. 3,102, 
filed Jan. 15, 1979, and now abandoned. 5 

The invention described herein was made in the 
course of work under a grant or award from the Na- 
tional Science Foundation. 

This invention is a method for producing in bacteria 
prokaryotic or eukaryotic proteins in native, unfused 10 
form free from extraneous peptides. 

Recombinant DNA techniques in vitro have been 
used to insert a variety of eukaryotic genes into plas- 
mids carried by Escherichia coli in an effort to induce 
these bacteria to produce eukaryotic proteins. Most of 15 
these genes have not directed the synthesis of the native 
proteins because the eukaryotic signals coding for initia- 
tion of transcription and/or translation do not function 
well in E. coli. One proposed solution to this problem 
has been the fusion of the eukaryotic gene with a bacte- 20 
rial gene. The process results in the production of a 
hybrid protein, a portion of which at its carboxyl termi- 
nus is constituted by the eukaryotic protein. In one case, 
it has been possible to separate a small biologically 
active protein from a fusion product (Itakura, K. et al., 25 
Science 198, 1056 (1977)). 

Gene expression takes place by transcription into 
mRNA then translation into protein. To do these opera- 
tions, the DNA preceding the gene must have a se- 
quence which; (a) directs efficient binding of bacterial 30 
RNA polymerase and efficient initiation of transcrip- 
tion, and (b) codes for a mRNA that directs efficient 
binding of mRNA to the ribosomes and initiation of 
translation into protein. 

The present invention provides a method of produc- 35 
ing native, unfused prokaryotic or eukaryotic protein in 
bacteria which comprises inserting into a bacterial plas- 
mid a gene for a prokaryotic or eukaryotic protein and 
a portable promoter consisting of a DNA fragment 
containing a transcription initiation site recognized by 40 
RNA polymerase and containing no protein transla- 
tional start site, said promoter being inserted ahead of a 
protein translational start site of said gene to form a 
fused gene having a hybrid ribosome binding site, in- 
serting said plasmid into said bacteria to transform said 45 
bacteria with said plasmid containing said fused gene, 
and culturing the transformed bacteria to produce said 
prokaryotic or eukaryotic protein. 

The present invention utilizes nucleases, restriction 
enzymes, and DNA ligase to position a portable pro- 50 
moter consisting of a DNA fragment containing a tran- 
scription site but no translation initiation site near the 
beginning of the gene which codes for the desired pro- 
tein to form a hybrid ribosomal binding site. The pro- 
tein produced by the bacterium from this hybrid is the 55 
native derivative of the implanted gene. It has been 
found that the endonuclease digestion product of the E. 
coli lac operon, a fragment of DNA which contains a 
transcription initiation site but no translational start site, 
has the required properties to function as a portable 60 
promoter in the present invention, being transcribed at 
high efficiency by bacterial RNA polymerase. The 
mRNA produced contains a Shine-Dalgarno (S-D) 
Sequence but it does not include the AUG or GUG 
required for translational initiation. However, in accor- 65 
dance with the present invention, a hybrid ribosomal 
binding site is formed consisting of the S-D sequence 
and initiator from the lac operon and the ATG sequence 
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of the gene, and such a fused gene is translated and 
transcribed efficiently. Using the enzymes exonuclease 
III and SI, the promoter may be put at any desired 
position in front of the translational start site of the gene 
in order to obtain optimum production of protein. Since 
the promoter can be inserted at a restriction site ahead 
of the translational start site of the gene, the gene can 
first be cut at the restriction site, the desired number of 
base pairs and any single stranded tails can be removed 
by treating with nucleases for the appropriate time per- 
iod, and religating. 

The following specific example is intended to illus- 
trate more fully the nature of the present invention 
without acting as a limitation upon its scope. 

EXAMPLE 

A rabbit ;S-globin gene was first cloned into the Hin 
III site of pBR322, a plasmid of the E. coli bacteria, via 
restriction enzyme cuts of the initial DNA, reconstitu- 
tion of the gene by T4 ligase, insertion of the reconsti- 
tuted gene into the Hin III site using chemically synthe- 
sized Hin III linkers, and religating with DNA ligase. 

The Hin III cut at the carboxyl end of the cloned 
gene was removed by partially digesting with Hin III, 
filling in the resulting Hin III "sticky ends" with E. coli 
DNA polymerase I, and religating with T4 ligase. This 
left in the resulting plasmid a single Hin III cut 25 base 
pairs ahead of the amino terminus of the globin gene. 

Differing number of the 25 base pairs between the 
Hin III cut and the ATG signalling the start point of 
translation were removed from different samples of the 
cloned gene as follows: the plasmid was cut with Hin 
III, resected for various times from 0.5 to 10 minutes 
with Exo III, then treated with SI to remove single- 
stranded tails. 

The portable promoter of the lac operon, an Rl-Alu 
restriction fragment of E. co// DNA, was then inserted 
by treating each sample of the plasmid with Rl which 
cuts at a unique site some 30 base pairs upstream from 
the Hin III site, and the portable promoter was insertec 
into the plasmid backbone at this site. This requires one 
"sticky end" and one "flush" end, both of which are 
ligated by the same treatment with ligase. 

Colonies of E. co/z each containing one of these result- 
ing plasmids were then screened for yS-globin produc- 
tion using RIA-screening techniques to identify the one 
or more producing ;3-globin. 

The globin gene in the above construction can be any 
gene coding for prokaryotic or eukaryotic proteins, and 
any other unique restriction site can be employed in 
place of the Hin III site. If the restriction site is located 
inconveniently far from the beginning of the gene, it 
may be moved (for example, a Hin III site may be 
moved by opening the plasmid with Hin III, digesting 
with Exo III and SI, then religating the resulting plas- 
mid in the presence of excess Hin III linkers). Any 
suitable restriction site can be employed for insertion of 
the portable promoter in place of the Rl site (e.g. Pst, 
BAM, or Sal I). Finally, it should be emphasized that 
the most difficult step, the cloning of the gene into the 
plasmid, is done once and left unchanged. The promoter 
fragment will confer its constitutive expression on the 
cell so it is easy to screen for the intact promoters. 
What is claimed is: 

1. The method of producing native, unfused prokary- 
otic or eukaryotic protein in bacteria which comprises 
inserting into a bacterial plasmid a gene for a prokary- 
otic or eukaryotic protein and a portable promoter 
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consisting of a DNA fragment containing a Shine-Dal- 
garno sequence and a transcription initiation site recog- 
nized by RNA polymerase and containing no protein 
translational start site, said promoter being inserted 
upstream from a protein ATG translational start site of 
said gene, at a position to obtain production of said 
protein, to form a fused gene having said Shine-Dal- 
garno sequence, said transcription initiation site, and the 
ATG signalling the start point of translation, inserting 
said plasmid into said bacteria to transform said bacteria 
with said plasmid containing said fused gene, and cul- 
turing the transformed bacteria to produce said unfused 
prokaryotic or eukaryotic protein. 

2. The method as claimed in claim 1 in which said 
bacteria is E. coli. 
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3. The method as claimed in claims 1 or 2 in which 
said portable promoter is the product of restriction 
endonuclease digestion of an operon. 

4. The method claimed in claim 2 in which said porta- 
ble promoter is the product of restriction endonuclease 
digestion of the lac operon of E. coli. 

5. The method as claimed in claims 1 or 2 in which 
said gene insertion comprises cloning said gene into said 
plasmid, adjusting the spacing between said inserted 

) gene and a preceding unique restriction site by treating 
with a nuclease, and cloning said portable promoter into 
said restriction site. 

6. The method as claimed in claim 2 in which said 
gene insertion comprises cloning said gene into said 

) plasmid, adjusting the spacing between said gene and a 
preceding unique restriction site by treating with a nu- 
clease, and cloning into said restriction site a portable 
promoter formed by the endonuclease digestion of the 
lac operon of E. coli. 
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[57] ABSTRACT 

This invention is a process to produce specific proteins 
coded for by eukaryotic (or prokaryotic) DNA in bac- 
teria. The invention, which uses recombinant DNA 
techniques, produces proteins in their natural, func- 
tional state unencumbered by extraneous peptides. 
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FUSED HYBRID GENE 

This is a division, of application Ser. No. 111,101, 
filed Jan. 10, 1980, now U.S. Pat. Not. 4,332,892, which 5 
is a continuation of Ser. No. 3,102 filed Jan. 15, 1979 
now abandoned. 

The invention described herein was made in the 
course of work under a grant or award from the Na- 
tional Science Foundation. 10 

This invention is a method for producing in bacteria 
prokaryoctic or eukaryotic proteins in native, unfused 
form free from extraneous peptides. 

Recombinant DNA techniques in vitro have been 
used to insert a variety of eukaryotic genes into plas- 15 
mids carried by Escherichia coli in an effort to induce 
these bacteria to produce eukaryotic proteins. Most of 
these genes have not directed the synthesis of the native 
proteins because the eukaryotic signals coding for initia- 
tion of transcription and/or translation do not function 20 
well in E. coli. One proposed solution to this problem 
has been the fusion of the eukaryotic gene with a bacte- 
rial gene. The process results in the production of a 
hybrid protein, a portion of which at its carboxyl termi- 
nus is constituted by the eukaryotic protein. In one case, 25 
it has been possible to separate a small biologically 
active protein from a fusion product (Itakura, K. et al., 
Science 198, 1956 (1977)). 

Gene expression takes place by transcription into 
niRNA then translation into protein. To do these opera- 30 
tions, the DNA preceding the gene must have a se- 
quence which: (a) directs efficient binding of bacterial 
RNA polymerase and efficient initiation of transcrip- 
tion, and (b) codes for a mRNA that directs efficient 
binding of mRNA to the ribosomes and initiation of 35 
translation into protein. 

The present invention provides a method of produc- 
ing native, unfused prokaryolic or eukaryotic protein in 
bacteria which comprises inserting into a bacterial plas- 
mid a gene for a prokaryotic or eukaryotic protein and 40 
a portable promoter consisting of a DNA fragment 
containing a transcription initiation site recognized by 
RNA polymerase and containing no protein transla- 
tional start site, said promoter being inserted ahead of a 
protein translational start site of said gene to form a 45 
fused gene having a hybrid ribosome binding site, in- 
serting said plasmid into said bacteria to transform said 
bacteria with said plasmid containing said fused gene, 
and culluring the transformed bacteria to produce said 
prokaryotic or eukaryotic protein. 50 

The present invention utilizes nucleases, restriction 
enzymes, and DNA ligase to position a portable pro- 
moter consisting of a DNA fragment containing a tran- 
scription site but no translation initiation site near the 
beginning of the gene which codes for the desired pro- 55 
tein to form a hybrid ribosomal binding site. The pro- 
tein produced by the bacterium from this hybrid is the 
native derivative of the implanted gene. It has been 
found that the endonuclease digestion product of the E. 
coli lac operon, a fragment of DNA which contains a 60 
transcription initiation site but no transladonal start site, 
has the required properties to function as a portable 
promoter in the present invention, being transcribed at 
high efficiency by bacterial RNA polymerase. The 
mRNA produced contains a (Shine-Dalgarno (S-D) 65 
Sequence) but it does not include the AUG or CUG 
required for translational initiation. However, in accor- 
dance with the present invention, a hybrid ribosomol 
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binding site is formed consisting of the S-D sequence 
and initiator from the lac operon and the ATG sequence 
of the gene, and such a fused gene is translated and 
transcribed efficiently. Using the enzymes exonuclease 
111 and SI, the promoter may be put at any desired 
position in front of the translational start site of the gene 
in order to obtain optimum production of protein. Since 
the promoter can be inserted at a restriction site ahead 
of the translational start site of the gene, the gene can 
first be cut at the restriction site, the desired number of 
base pairs and any single standard tails can be removed 
by treating with nucleases for the appropriate time per- 
iod, and religating. 

The following specific example is intended to illus- 
trate more fully the nature of the present invention 
without acting as a limitation upon its scope. 

EXAMPLE 

A rabbit y8-globin gene was first cloned into the Hin 
III site of pBR322, a plasmid of the E. coli bacteria, via 
restriction enzyme cuts of the initial DNA, reconstitu- 
tion of the gene by T4 ligase, insertion of the reconsti- 
tuted gene into the Hin III site using chemically synthe- 
sized Hin III linkers, and rehgaring with DNA ligase. 

The Hin III cut at the carboxyl end of the cloned 
gene was removed by partially digesting with Hin III, 
filling in the resulting Hin III "sticky ends" with E. coli 
DNA polymerase I, and religating with T4 ligase. This 
left in the resulting plasmid a single Hin 111 cut 25 base 
pairs ahead of the amino terminus of the globin gene. 

Differing numbers of the 25 base pairs between the 
Hin III cut and the ATG signalling the start point of 
translation were removed from different samples of the 
cloned gene as follows: the plasmid was cut with Hin 
III, resected for various times from 0.5 to 10 minutes 
with Exo III, then treated with SI to remove single- 
standard tails. 

The portable promoter of the lac operon, an Rl-Alu 
restriction fragment of E. coli DNA, was then inserted 
by treating each sample of the plasmid with Rl which 
cuts at a unique site some 30 base pairs upstream from 
the Hin III site, and the portable promoter was inserted 
into the plasmid backbone at this site. This requires one 
"sticky end" and one "flush" end, both of which are 
ligated by the same treatment with ligase. 

Colonies of E. coli each containing one of these result- 
ing plasmids were then screened for ;8-globin produc- 
tion using RIA-screening techniques to identify the one 
or more producing /3-globin. 

The globin gene in the above construction can be any 
gene coding for prokaryotic or eukaryotic proteins, and 
any other unique restriction site can be employed in 
place of the Hin III site. If the restriction site is located 
inconveniently far from the beginning of the gene, it 
may be moved (for example, a Hin III site may be 
moved by opening the plasmid with Hin III, digesting 
with Exo III and SI, then religating the resulting plas- 
mid in the presence of excess Hin III linkers). Any 
suitable restriction site can be employed for insertion of 
the portable promoter in place of the Rl site (e.g. Pst, 
BAM, or Sal I). Finally, it should be emphasized that 
the most difficult step, the cloning of the gene into the 
plasmid, is done once and left unchanged. The promoter 
fragment will confer its constitutive expression on the 
cell so it is easy to screen for the intact promoters. 

What is claimed is: 

1. A fused hybrid gene capable of expressing native 
unfused prokaryotic or eukaryotic protein consisting 
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essentially of (1) a portable promoter including a por- 
tion of a bacterial gene having a Shine-Dalgarno se- 
quence and a transcription initiation site recognized by 
RNA polymerase and containing no protein transla- 
tional start site, and (2) fused thereto a gene for a native 
unfused prokaryotic or eukaryotic protein including its 
translational start site, said portable promoter being 
located upstream from said translational start site. 

2. A fused gene as claimed in claim 1 in which said 
promoter is the product of restriction endonuclease 
digestion of an operon. 

3. A fused gene as claimed in claim 2 in which said 
bacterial gene is that of E. coli and said promoter is the 
product of restriction endonuclease digestion of the lac 
operon of E. coli. 
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4. A fused gene as claimed in claim 1 wherein said 
translational start site of said gene for said prokaryotic 
or eukaryotic protein is the sequence ATG. 

5. A fused gene as claimed in claim 1 wherein said 
5 portable promoter and said translational start site of said 

prokaryotic or eukaryotic gene together comprise a 
hybrid ribosomal binding site. 

6. A fused gene as claimed in claim 1 wherein said 
gene for a prokaryotic or eukaryotic protein is a gene 

10 for a eukaryotic protein. 

7. A bacterium containing the fused gene of claim 1, 
said bacterium being capable of producing said prokary- 
otic or said eukaryotic protein. 

8. A bacterium containing the fused gene of claim 6, 
15 said bacterium being capable of producing said eukary- 
otic protein. 
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